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1.  Introduction 

This  is  the  final  report  of  a  two  year  study  on  the  use  of  fiizzy  pattern  recognition  of 
polygraph  data  for  the  identification  of  truth  versus  deception.  The  goals  of  this  study  as 
stated  in  the  original  proposal  where  to: 

1 .  develop  a  data  parsing  algorithm  which  will  process  polygraph  data  obtained  from  the 
NSA  into  three  domains;  time-domain,  frequency  domain,  and  correlation  domain; 

2.  design  a  fuzzy  classifier  algorithm  to  accept  the  featurized  data  and  modify  its 
membership  functions  based  on  the  error  between  its  classification  of  the  polygraph 
data  and  the  classification  in  the  NSA  files; 

3 .  study  relationship  between  number  of  membership  functions  an  the  success  of  the  data 
classification  and; 

4.  investigate  the  feasibility  of  the  classification  being  performed  in  a  near-real-time 
scenario. 

The  data  to  be  used  was  MGQT  polygraph  data.  However,  the  proposal  for  the  second 
year  of  the  study  introduced  the  goal  of  comparing  the  performance  of  the  developed 
fuzzy  classification  system  with  “zone  comparison”  polygraph  data.  Ultimately  this  was 
changed  to  be  the  simulated  “relevant  only”  data  obtained  from  DODPI. 

There  were  two  secondary  objectives  of  this  project.  First,  are  the  features  identified  as 
optimal  in  determining  the  veracity  of  a  subject  optimal  for  all  subjects.  Second,  are  there 
features  not  presently  being  used  in  polygraph  analysis  the  may  be  optimal. 

This  report  and  its  attached  appendices  will  show  that  all  objectives  of  the  original 
proposal  where  met.  A  fuzzy  parser  and  classifier  system  were  developed  that  could  run 
in  near  real-time,  achieve  performances  as  good  or  better  than  the  presently  available 
automatic  polygraph  systems,  and  identify  new  features  that  previously  where  not  used  in 
polygraph  classification.  Results  of  97%  correct  for  the  MGQT  data  and  100%  correct 
for  the  “relevant  only”  data  were  achieved.  It  will  be  shown  that  while  certain  features 
yield  good  identification  across  all  subjects,  a  clustering  algorithm,  fuzzy  C-means, 
developed  in  the  second  phase  of  this  work  identified  many  sets  of  features  that  probably 
should  be  tried  to  achieve  optimal  performance. 
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2.  Phase  1: 1993-1994 

The  first  phase  of  this  project  developed  a  complete  automatic  data  parsing  system  and 
fuzzy  pattern  recognition  system  based  on  the  fuzzy  k  nearest  neighbor  algorithm.  These 
two  elements  are  summarized  below. 

2.1  Development  of  Data  Parsing  Algorithm 

The  initial  goal  of  this  phase  was  to  be  able  to  read  the  MGQT  data  files  received  from  the 
NS  A  and  separate  this  data  into  appropriate  features  for  classification.  After  consulting 
with  the  University  of  Washington,  we  were  able  to  develop  our  own  data  reading 
program. 

After  consultation  with  experienced  polygraph  examiners  and  a  detailed  review  of  the 
polygraph  literature,  the  data  reading  program  was  then  modified  to  parse  the  data  into  a 
matrix  of  features.  The  feature  set  included,  as  outlined  in  the  project  proposal,  time 
domain,  frequency  domain,  and  correlation  domain  data.  Some  examples  of  the  feature  set 
are; 

Time  Domain  Features 

-  Mean,  curvelength,  area,  and  standard  deviation  for  all  polygraph  channels 

-  Average  of  the  amplitudes  of  the  peaks  in  the  cardio  and  respiratory  channels 

-  Derivative  of  the  amplitudes  of  the  peaks  of  cardio  and  respiratory  channels 

-  Number  of  peaks  in  the  cardio  and  respiratory  channels 

-  Inhalation  amplitude/exhalation  amplitude  of  respiratory  channels 

Frequency  Domain  Features 

-Fundamental  frequency  of  cardio  and  respiratory  signals 

-Coherency  and  cross  power  spectral  density  between  cardio  and  respiratory  channels 
-Power  spectral  density  of  cardio  and  respiratory  channels 
-Integrated  power  spectral  density  for  cardio  channel 

Correlation  Domain  Features 

-  Autoregressive  parameters  (10)  for  cardio  signal 

-  Cross-correlation  between  cardio  and  respiratory  channels 

In  order  to  classify  subjects  using  the  difference  between  control  and  relevant  responses, 
and  to  minimize  the  size  of  the  feature  vector,  the  features  were  combined  according  to 
the  following  method:  for  each  feature  /  (except  for  the  three  features  corresponding  to 
the  cross  power  spectral  density  and  integrated  spectral  difference)  from  each  subject  j 
compute: 
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1.  The  average  control  responses  AvgConty 

2.  The  average  relevant  responses  AvgRel^j 

3.  The  maximum  and  minimum  control  responses  MaxContyMinConty 

4.  The  maximum  and  minimum  relevant  responses  McpcRelyMinRely 

The  feature  vector  components  for  feature  i  are  then: 

1 .  Avg  Re  ly  -  AvgConty 

Avg  Re  4  -  AvgConty 

2.  - i - i- 

Avg  Re  4  +  AvgConty 

3 .  Max  Re  4  -  MaxConty 

4 .  Min  Re  4  -  MinConty 

5 .  Max  Re  ly  -  MinConty 

6.  Min  Re  4  -  MaxConty 

Max  Re  4 

7.  - ^ 

MaxConty 

For  the  three  features  mentioned  previously  that  cannot  be  combined  as  above  then  from 
each  subject  j  compute: 

1.  The  average  of  relevant-control  responses  Avg(RQlCont)y 

2.  The  maximum  of  relevant-control  responses  Max(^QlCont)y 

3.  The  minimum  of  relevant-control  responses  Min(RQlCont)y 

For  a  complete  description  of  this  method,  see  the  report  in  Appendix  B  entitled  Feature 
Analysis  of  the  Polygraph  by  Mitra  Dastmalchi. 

Ultimately  669  features  were  automatically  extracted  from  the  data.  The  complete  list  of 
all  669  features  used  in  this  project  are  shown  in  Table  1  in  the  report  in  Appendix  D 
entitled  Pattern  Recognition  of  the  Polygraph  Using  Fuzzy  Set  Theory.  The  use  of  this 
automatic  data  parsing  algorithm  is  described  in  more  detail  in  section  4.1. 

2.2  Design  of  Fuzzy  Classifier  Algorithm 

Fuzzy  classifier  design  first  focused  on  the  development  of  a  fuzzy  set  based  k  nearest 
neighbor  algorithm.  (This  work  is  described  in  detail  in  Appendix  C  entitled  Pattern 
Recognition  of  the  Polygrcq)h  Using  Fuzzy  Set  Theory  and  in  Pattern  Recognition  of  the 
Polygraph  Using  Fuzzy  Classification,  Proceedings  of  the  1994  IEEE  International 
Conference  on  Fuzzy  systems,  Vol  III,  pages  1825-1829.)  This  algorithm  is  a  supervised 
learning  algorithm  which  means  that  training  data  is  presented  to  the  algorithm  and  then 
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the  algorithm  is  “frozen”  and  test  data  is  presented.  Training  on  this  and  all  other 
algorithms  in  both  phases  of  the  study  was  always  performed  on  3/4  of  the  data  with 
testing  performed  on  the  remaining  1/4  of  the  data.  The  algorithm  learned  using  a  set 
of  MGQT  data  divided  equally  between  truthful  and  deceptive.  Since  there  were  150 
deceptive  files  and  only  50  truthful  files,  the  deceptive  files  were  divided  into  three  sets  of 
50  files  each.  When  a  question  was  asked  more  than  once  by  an  examiner  the  questions 
were  scored  individually  and  then  combined  at  the  end  on  a  majority  basis.  The  results  of 
this  work  are  summarized  collectively  in  section  4.2  below. 

3.  Phase  II:  1994-1995 

The  second  phase  of  this  project  dealt  with  creating  an  unsupervised  clustering  algorithm 
which  could  identify  important  features  more  rapidly,  creating  another  supervised  learning 
algorithm  to  determine  if  the  fuzzy  k-NN  algorithm  was  optimal  (fuzzy-LMS),  creating  a 
genetic  search  algorithm  to  try  to  aid  in  the  search  for  optimal  features,  and  expanding  the 
algorithm  testing  to  look  at  simulated  “relevant-only”  data  from  DODPI  in  addition  to  the 
MGQT  data.  These  elements  are  summarized  in  the  two  sections  below 

3.1  Comparison  of  the  Fuzzy  C-means,  Fuzzy  LMS,  and  Fuzzy  k- 
NN  Algorithm 

An  unsupervised  clustering  algorithm  was  created  to  visualize  which  features  allow  for 
larger  separation  in  the  truthful  and  deceptive  data  clusters.  In  addition,  a  supervised 
learning  algorithm,  fuzzy  LMS,  was  created  to  compare  with  fuzzy  C-means  and  fuzzy  k- 
NN.  (This  work  is  described  in  much  more  detail  in,  and  partially  excerpted  from. 
Appendix  D,  Use  of  Fuzzy  Set  Classification  for  Pattern  Recognition  of  the  Polygraph, 
and  in  Classification  of  Deception  Using  Fuzzy  Pattern  Recognition,  Psychophysiology, 
Volume  31,  Supp.l,  August  1994.) 

The  fuzzy  LMS  system  is  unique  in  its  application  of  linguistic  knowledge.  The  use  of 
linguistic  knowledge  ensures  the  robustness  of  the  fuzzy  system.  The  use  of  linguistic 
information  also  ameliorates  the  problem  of  not  having  enough  reliable  numerical  data. 
Unlike  classification  schemes  such  as  the  K-Nearest  Neighbor,  the  fuzzy  LMS  algorithm  is 
not  entirely  dependent  on  numerical  data. 

When  applied  to  pattern  recognition,  fiizzy  logic  systems  can  be  set  up  to  perform  like 
KNN  systems.  In  KNN  systems,  numerical  data  of  known  class  patterns  are  set  up  to 
estimate  the  probability  density  distribution  of  the  classes.  The  probabilities  of  new  data 
points  belonging  to  the  different  classes  are  then  computed  based  on  such  distribution. 
Data  points  around  known  class  samples  are  then  classified  into  the  same  class  with  a 
higher  probability.  The  flizzy-KNN  algorithm  modifies  the  classical  KNN  algorithm  by 
taldng  into  account  the  distance  between  the  data  point  and  the  known  class  patterns  when 
estimating  the  probability.  Conceptually  this  is  similar  to  setting  up  clusters  around  all 
known  class  samples  and  calculating  the  degree  of  belonging  of  new  data  points  in  the 
different  types  of  clusters.  Other  than  the  exact  mathematical  equations,  that  description 
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fits  a  fuzzy  adaptive  system  where  each  rule  corresponds  to  a  known  class  pattern  and  the 
size  of  the  clusters  is  the  same  for  all  rules. 

However,  fiizzy  adaptive  systems  give  up  some  of  the  nice  theoretical  understandings  of 
the  KNN  systems  but  gain  some  practical  advantages.  The  number  of  rules  required  are 
usually  much  smaller  than  the  number  of  known  samples.  Fuzzy  logic  can  usually  exploit 
that  to  reduce  system  complexity. 

Furthermore,  the  system  complexity  for  a  fuzzy  adaptive  system  stays  the  same  even  as 
new  information  are  available.  This  is  partly  a  result  of  the  way  this  algorithm  adapt 
continuously;  new  information  are  learned  as  old  ones  are  forgotten.  The  fiizzy  LMS 
learning  technique  is  like  backpropagation,  a  popular  neural  network  training  techmque. 
However,  the  fiizzy  LMS  learning  algorithm  requires  few  epochs  for  training.  In  all  our 
trials  the  maximum  recognition  rates  for  testing  data  peaked  in  less  than  thirty  epochs. 
About  95%  of  them  peaked  in  less  than  twenty  epochsl.  This  is  a  few  orders  of  magnitude 
less  than  most  applications  of  backpropagation.  In  many  cases  the  peaks  occurred  before 
any  training;  that  is,  the  system  uses  only  linguistic  rules.  Here  the  use  of  expert 
knowledge  speeds  up  the  training  of  the  system. 

The  fuzzy-c-means  algorithm,  unlike  fiizzy  LMS,  is  an  unsupervised  clustering  algorithm. 
Given  a  set  of  data,  FCM  looks  for  a  (usually)  predetermined  number  of  clusters  within 
the  data  points.  It  does  not  use  any  knowledge  about  the  correct,  or  desired  classification 
of  any  of  the  elements.  The  algorithm  only  minimizes  an  objective  function,  which  is  the 
sum  of  a  function  of  the  data  points'  membership  values  and  the  distances  between  the 
data  points  and  the  clusters'  centers. 

FCM  operates  like  a  black  box;  given  some  data,  the  algorithm  automatically  computes 
the  results2.  This  presents  the  advantage  that  different  sets  of  data  using  different  features 
can  be  tested  in  a  routine  manner.  FCM  also  presents  a  way  to  normalize  the  different 
dimensions  of  the  data,  just  like  the  use  of  sigma  in  the  fuzzy  LMS  algorithm.  However, 
unlike  fiizzy  LMS,  FCM  does  not  present  a  method  to  find  the  optimal  way  for  such 
normalization. 

The  fiizzy  LMS  algorithm,  however,  does  pose  some  potential  problems  of  its  own.  The 
use  of  expert  knowledge,  while  a  benefit  in  some  senses,  may  not  be  always 
straightforward.  For  example,  in  our  project  we  did  not  have  any  specific  knowledge 
about  the  polygraphy  itself.  Whatever  we  learned,  we  learned  by  looking  at  numerical 
data.  As  we  tried  to  find  more  complicated  patterns,  patterns  involving  three,  four,  or 
more  features,  the  analysis  became  more  difficult.  Naturally  one  wishes  to  automate  this 
process.  If  we  do  not  rely  on  some  learning  procedures,  however,  rules  cannot  be 
automatically  found  for  the  fiizzy  system.  Much  research  also  needs  to  be  done  to 
understand  the  fiizzy  LMS  algorithm's  learning  dynamics.  While  the  same  method, 
gradient  descent,  is  used  on  both  backpropagation  and  the  fiizzy  LMS  algorithm,  the 
general  shapes  of  the  error  surface  between  the  two  are  different.  In  backpropagation,  all 


'However,  we  ran  every  trial  to  forty  epochs  to  ensure  that  there  is  no  "false"  peak. 
^Our  job  is  basically  to  adjust  the  parameters. 
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the  parameters  have  the  same  range  and  lie  in  an  uniform  neural  network  structure.  In  the 
fiizzy  LMS  algorithm,  the  parameters  can  have  different  ranges  and  lie  a  fuzzy  logic 
structure  that  is  not  completely  uniform.  The  effects  of  such  differences  on  the  shape  of 
the  error  surface  and  the  learning  dynamic  are  unknown. 

A  summary  of  the  data  comparing  these  methods  is  presented  in  section  4.2  below.  All 
MGQT  data  was  processed  as  was  summarized  in  section  2.2  above. 

3.2  Fuzzy  C-means  Algorithm  on  “Relevant  Only”  Data 

The  data  parsing  algorithm  was  extensively  modified  to  process  the  relevant  only  data. 

This  data  was  composed  of  166  truthful  and  166  deceptive  tests  with  no  irrelevant 
questions  asked.  Thus  the  seven  techniques  of  data  combination  described  in  section  2. 1 
could  not  be  used.  Instead,  four  combinations  were  used  as  follows: 

1.  Avg(Feature) 

2.  Max(Feature)-Mm(Feature) 

3.  Max(Feature)  / Min(Feattire) 

4.  Std(Feature) 

Also,  these  files  were  in  an  entirely  different  data  format  which  head  to  be  interpreted  for 
data  parsing.  (See  Appendix  E  for  a  summary  of  incorrect  data  formats  from  the  “relevant 
only”  data.) 

4.  Summary  of  Results 

The  results  for  the  entire  project  are  summarized  below.  First,  the  complete  automatic 
data  analysis  package  is  summarized  including  data  parsing  and  classification.  Second, 
comparison  of  accuracies  amongst  the  different  methods  for  both  MGQT  and  “relevant 
only”  polygraph  data  is  presented. 

4.1  Automatic  Data  Analysis  Method 

Below  is  a  description  of  the  automatic  data  parsing  and  classification  technique 
developed  in  this  project.  Refer  to  Appendices  A-D  for  a  more  complete  description. 

4.1.1  Parsing  the  Data 

4. 1. 1. 1  Reading  the  Data 

It  should  be  noted  that  the  data  reading  methods  are  only  important  for  “off-line” 
processing  and  would  not  be  used  for  near  real-time  applications. 

The  data  was  collected  in  three  phases  labeled  ERS-1,  ERS-2  and  ERS-3.  Each  polygraph 
test  may  consist  of  one  to  five  charts  with  each  chart  consisting  of  three  files.  Each  chart  is 
a  series  of  questions,  usually  ten  questions.  The  files  are  given  in  DOS  file  format  and 
must  be  read  and  decoded  before  they  can  be  seen. 
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The  following  files  comprise  a  chart: 

$$EACOWO.011 

$$EACOWO.021 

$$EACOWO.OS1 

Each  of  these  three  files  has  a  specific  significance.  The  .XX3  files  are  text  files  which 
contain  the  questions  which  the  subjects  were  asked.  The  .XXI  and  .XX2  files  are 
encoded  in  a  specific  format  created  by  Axciton  polygraph  testing  devices.  These  files  can 
be  decoded  by  a  program  entitled  reads.  Reads  can  be  invoked  in  DOS  as  in  the  following 
example: 

reads  $$EACOWO.011  outputl 
reads  $$EACOWO.021  output 2 
reads  $$EACOWO.OS1  outputs 

The  reads  command  decodes  the  data  in  files  X.Ol  1,  X.021  and  X.OS  1  and  writes  them  in 
ASCII  files  entitled  outputl,  output2  and  output  S,  respectively.  Output2  and  outputs 
contain  the  actual  signals  from  four  polygraph  channels  with  a  timing  signal  which  shows 
the  times  when  the  questions  were  asked.  The  output  files  were  labeled  such  that  minimal 
confusion  was  allowed.  For  example,  the  output  file  for  non-deceptive  subject  45,  text  file 
.XXS  compiled  during  phase  ERS-1  reads: 

nd45tS.exl 

4. 1. 1. 2  Feature  Extraction 

After  the  polygraph  files  are  decoded  and  written  into  output  files,  they  can  be  processed 
in  MATLAB.  MATLAB  is  a  commercially  available  mathematical  analysis  program  which 
runs  on  a  PC,  Macintosh,  and  most  UNIX  platforms.  The  feature  extraction  process 
consists  of  a  MATLAB  program  which  extracts  features  for  all  files  and  saves  them  in  a 
matrix  consisting  of  subjects  and  features.  The  main  feature  extraction  program  is  a 
MATLAB  routine  called  Do.M.  This  program  extracts  the  pre-selected  52  features,  from 
each  subject,  contained  in  the  variable  feature  list.  Feature_list  is  a  MATLAB  matrix 
which  includes  the  names  of  the  feature  extraction  routines.  In  each  row  of  the  featurejist 
matrix,  a  feature  extraction  routine  is  named  along  with  the  channel  number(s)  this  routine 
will  be  applied  to.  The  mean,  standard  deviation,  maximum  subtracted  from  the  minimum 
and  the  maximum  divided  by  the  minimum  is  taken  of  the  extracted  features.  These  four 
results  are  put  into  a  matrix  which  is  then  put  into  a  larger  matrix  called  xlO.mat, 
consisting  of  all  non-deceptive  and  deceptive  subjects  and  all  52  features  from  the  feature 
list. 

4.1.2  Classifying  the  Data 

After  the  data  is  parsed  in  DOS  and  MATLAB,  the  classifying  process  takes  place  entirely 
in  MATLAB. 
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4. 1. 2. 1  K-Nearest  Neighbor  Algorithm 

The  main  program  which  runs  the  KNN  algorithm  is  called  fknn  which  is  written  in  the  C 
programming  language.  This  file  interacts  with  MATLAB  by  reading  and  writing  files  in 
MATLAB  format,  that  is  .mat  files.  This  algorithm  is  implemented  by  the  program  fknn 
which  opens  a  MATLAB  data  file,  reads  the  training  matrix,  classifies  each  entry  in  the 
testing  matrix  and  writes  the  result  in  an  output  file.  The  file  from  which  this  program 
receives  information  fi'om  is  “fdatafile.mat”  which  is  in  MATLAB  file  format. 

Because  the  KNN  algorithm  has  been  automated,  it  can  be  run  in  only  a  few  simple  steps. 
For  a  complete  description  of  this  process  see  Appendix  C.  Before  running  the  algorithm  a 
few  variables  must  be  determined.  For  example,  for  the  “relevant  only”  data; 

1.  A  single  variable  ‘C’,  the  number  of  classes  was  set  equal  to  two  for  deceptive  and 
non-deceptive. 

2.  A  single  variable  ‘K’,  determines  how  many  different  points  surrounding  a  chosen 
point  will  be  compared  to  it  and  classified.  The  parameter  ‘K’  in  the  K-NN  algorithm 
was  varied  from  one  to  ten  throughout  the  simulations. 

3.  A  single  variable  ‘M’,  the  coefficient  in  the  fuzzy  algorithm  was  set  equal  to  two. 

4.  A  training  matrix  ‘P’,  contains  a  set  of  feature  vectors.  Each  vector  is  a  column  of  the 
matrix.  There  were  fifty  deceptive  and  fifty  non-deceptive  tests  used  for  training.  The 
combination  of  features  to  be  tested  is  also  entered  in  this  matrix. 

5.  A  class  membership  matrix ‘T’,  which  contains  the  membership  values  of  the  training 
set  vectors  to  the  classes.  This  matrix  was  set  such  that  a  one  was  displayed  for  a  non- 
deceptive  detection  and  a  zero  for  a  deceptive  detection. 

6.  An  input  matrix  ‘U’,  which  contains  a  set  of  unclassified  feature  vectors  contained  the 
rest  of  the  tests  not  used  for  training.  These  remaining  tests  make  up  the  testing 
matrix.  The  same  combination  of  features  entered  in  ‘P’  are  to  be  entered  in  the  ‘U’ 
matrix. 

7.  Threshold  which  is  varied  from  0.2  to  0.8  throughout  the  simulations. 

Once  the  matrix  XlO.mat  is  loaded  in  MATLAB,  the  KNN  algorithm  can  be  invoked  by 
simply  typing  “KNN”.  The  user  will  then  be  asked  to  enter  a  numerical  value  for  the  K 
parameter  in  the  K-NN  algorithm.  Parameters  chosen  between  one  and  ten  have  been 
found  to  produce  the  best  results.  Once  the  k  parameter  has  been  entered,  the  number  of 
correct  deceptive  and  non-deceptive  detections  can  be  obtained  by  entering  the  following: 

sum(fresult(l,l:116)>0.5)  non-deceptive 

sum(fresult( 1,11 7:232)<0. 5)  deceptive 

The  correct  detection  for  non-deceptive  data  is  shown  by  a  one,  so  the  threshold  is  greater 
than  0.5.  The  percent  correct  for  the  deceptive  data  can  be  obtained  by  dividing  the 
number  of  correct  deceptive  detections  by  166.  This  same  process  works  for  the  non- 
deceptive  data.  Finally,  the  total  correct  detection  percent  is  obtained  by  taking  the 
average  of  the  two  percentages. 
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4. 1.2. 2  Fuzzy  C-Means 

The  Fuzzy  C-Means  algorithm  for  MGQT  data  has  been  made  user  friendly  through 
automated  push  buttons  written  in  MATLAB  (see  Figure  1).  These  buttons  allow  the  user 
to  execute  the  feature  extraction  and  classification  process  without  an  understanding  of 
the  complexity  of  each  program  used  in  the  algorithm.  With  minor  modifications,  the  push 
buttons  can  be  used  for  the  “relevant  only”  data  as  well. 


Figure  1:  User  Interface  for  Fuzzy  C-Means  Clustering  Algorithm 

Before  running  the  algorithm  a  few  variables  must  be  determined.  For  example,  for  the 
“relevant  only”  data: 

1.  The  ‘temp’  matrix  in  the  fc_means  program  was  set  equal  to  the  dimensions  (1,332). 

2.  The  threshold  was  varied  from  0.2  to  0.8  for  each  different  simulation  that  was  run. 

3.  Combination  of  features  to  be  tested  can  be  changed  as  described  below. 

The  following  execution  process  is  necessary  only  if  the  push  button  automation  is  not 
used.  After  the  matrix  XlO.mat  is  loaded,  the  user  must  type  the  following  to  run  the 
algorithm: 

[Uikz]  =  fc_means(5, 0.000005, xI0([8  23  24],:)) 

The  z  parameter  is  the  number  of  iterations  made  by  the  algorithm  to  obtain  the  results 
and  Uik  is  the  membership  values.  To  calculate  the  correct  detection  of  non-deceptive  and 
deceptive  subjects,  the  user  must  type  the  following: 
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sum(Uik(l,  1:166)<0.5)  non-deceptive 
sum(Uik(l,  11 7:332)>0.5)  deceptive 

where  0.5  is  the  selected  threshold  for  this  particular  simulation.  The  percent  correct  for 
each  class  can  be  determined  by  dividing  the  number  correct  by  the  total  number.  The  total 
percent  correct  is  then  obtained  by  averaging  the  two  percentages. 

4. 1. 2. 3  Least  Mean  Squares  Algorithm 

The  LMS  fiizzy  adaptive  filter  is  a  nonlinear  adaptive  filter  which  makes  use  of  both 
linguistic  and  numerical  information  concerning  the  physical  characteristics  of  the 
polygraph  data  in  their  natural  form.  This  filter  is  constructed  from  a  set  of  changeable 
fiizzy  IF-THEN  rules.  We  have  the  choice  of  setting  the  rules  according  to  our 
experiences  and  incorporating  them  directly  into  the  filter,  or  initializing  the  rules 
arbitrarily.  Before  running  the  algorithm  a  few  variables  must  be  determined.  For  example, 
for  the  “relevant  only”  data: 

1.  The  number  of  training  subjects  was  set  equal  to  100. 

2.  The  ‘running  time’,  how  often  the  algorithm  goes  through  the  data,  was  set  to  70. 

3.  Different  combinations  of  the  features  was  changed  manually  for  each  different 
simulation. 

After  the  matrix  XlO.mat  is  loaded,  the  user  must  simply  type: 

ImstesLm 

The  total  percent  correct  of  deceptive  and  non-deceptive  data  is  automatically  displayed 
under  the  variable  ‘maximum’. 

4.2  Classification  Accuracy 

4.2.1  MGQT 

Figure  2  shows  a  comparison  of  the  best  results  for  each  of  the  classification  algorithms 
found  in  this  study.  (See  Appendix  D  for  a  more  complete  description  of  how  this 
comparison  was  performed.)  It  should  be  noted  that  the  optimum  features  found  for  the 
fiizzy  c-means  and  the  fiizzy  k-NN  algorithms  were  different.  This  is  important  because  it 
means  that  if  both  algorithms  were  run  on  a  given  subject,  there  results  could  be 
independent  and  corroboratory.  The  fiizzy  LMS  algorithm  was  simply  run  using  the 
optimal  four  features  found  for  the  fiizzy  c-means  algorithm.  The  method  number  refers  to 
the  7  combination  methods  described  in  section  2. 1  above.  The  three  data  files  refer  to  the 
fact  that  the  150  deceptive  files  were  separated  into  three  files  of  50  and  compared  to  the 
50  non-deceptive  files. 
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Features  Used 

Data  Set  1 

Data  Set  2 

Data  Set  3 

Fuzzy  C-Means 

Ampl  of  Peaks(High  Freq 
Cardio)  Meth.  4, 

Max-Min(High  Freq  Cardio) 
Meth.  7, 

Std(GSR)  Meth.  2, 

Std(GSR)  Meth.  4 

93 

87 

97 

Fuzzy  k-NN 

Max(GSR)  Meth.  1, 

Max(Lower  Resp)  Meth.  6, 

Max(Upper  Resp)  Meth.  3, 

Max-Min(High  Freq  Cardio) 
Meth.  4 

86 

80 

91 

LMS  Fuzzy 

Ampl  of  Peaks(High  Freq 
Cardio)  Meth.  4, 

Max-Min(High  Freq  Cardio) 
Meth.  7, 

Std(GSR)  Meth.  2, 

Std(GSR)  Meth.  4 

81 

83 

83 

Figure  2:  Comparison  of  Different  Classification  Techniques  ofMGQT  Data  (in  percent 
correct) 


4.2.2  “Relevant  Only” 

For  the  relevant  only  data  the  fuzzy  c-means  algorithm  was  used  since  it  achieved  the  best 
performance  for  the  MGQT  data.  Figure  3  shows  the  summary  of  results  for  different 
combinations  of  the  four  optimal  features  described  in  Figure  2  above. 
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Feature(s) 


[2  4] 


[2  34]  IN:  100  |N:  96  |N:  78  |N:  51  |N:  11  |N:0.6  |N:  3 


D:0  ID:  2  |D: 28  |D: 58  |D:92  |D: 99  |D:  97 


[2  39] 


N:  100 


D:  2 


[4  34] 


N:  20 


D:  78 


[4  39] 


N:  61 


D:  37 


[34  39] 


N:  100 


D:  0.6 


N:  90 


D:  16 


[2  4  34] 


N:  100 


D:  0 


IN:  49 


D:  58 


N:  0.6 


P:  99 


[2  4  39] 


N:  29 

N:  24 

N:  34 

D:  45 

D:  48 

D:  54 

N:  0 


D:  24  ID:  1 


N:5  IN: 33  |N: 77  |N: 99  |N: 100 


D:5  p:32  D:  71  D:  99  |D:  100 


N:45  |N:34  |N:  3  |N:22  |N:23  |N:4 


|D:24  ID:  30  p:  6  p:  65  p:  54  p:  5  p:  66 


[2  4  34  39]|N:100  IN: 99  IN: 95  IN: 67  IN: 23  IN:  1  IN:  54 

dI'O  p!'0  |d:  5  Id:  33  |d:  71  |d:  99  |D:  33 


Figure  3:  Classification  of  "Relevant  Only"  Data  Using  the  Fuzzy  C-Means  Algorithm 
and  Different  Combinations  of  the  Four  Features  Given  in  Figure  2 
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Note  that  for  the  combination  of  three  features,  2,  34,  39  (which  correspond  to  Std(GSR) 

Meth.  2,  AmpI  of  Peaks(High  Freq  Cardio)  Meth.  4,  Max-Min(High  Freq  Cardio)  Meth. 

7)  a  score  of  100%  correct  for  both  deceptive  and  non-deceptive  was  achieved. 

5.  Conclusions 

This  project  achieved  all  goals  set  in  the  phase  1  and  phase  2  proposals: 

1 .  a  data  parsing  algorithm  was  developed  which  will  process  polygraph  data  obtained 
from  the  NSA  into  three  domains:  time-domain,  frequency  domain,  and  correlation 
domain; 

2.  several  frizzy  classifier  algorithms  were  designed  to  accept  the  featurized  data  and 
modify  its  membership  functions  based  on  the  error  between  its  classification  of  the 
polygraph  data  and  the  classification  in  the  NSA  files; 

3.  relationships  were  found  between  number  of  membership  functions  an  the  success  of 
the  data  classification  up  to  four  simultaneous  features; 

4.  the  feasibility  of  the  classification  being  performed  in  a  near-real-time  scenario  was 
shown;  and 

5.  near  perfect  scores  were  achieved  for  both  MGQT  and  “relevant  only”  data  without 
allowing  for  any  “don’t  know”  results. 
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Appendix  A:  Time  Domain  Features  for  the  Fuzzy 
Ciassification  of  PolyGraph  Data 
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Abstract 


A  polygraph  examination  is  the  most  popular  method  used  to  determine  if  an 
individual  is  being  truthful  or  deceptive.  During  an  examination,  a  subject  is  asked  a  series 
of  questions  and  the  physiological  responses  to  the  questions  are  recorded  using  a 
polygraph.  The  three  physical  responses  currently  obtained  from  a  polygraph 
examinations  are  blood  pressure,  respiration,  and  skin  conductivity.  Polygraph  charts  are 
usually  analyzed  by  a  human  interpreter  for  evidence  of  truth  or  deception;  however, 
computer  algorithms  are  now  being  used  to  verify  results  [1][2]. 

In  this  project,  the  K  nearest  neighbor  algorithm  was  used  to  determine  truth  or 
deception.  By  using  this  adaptive  fuzzy  system,  it  was  possible  for  the  computer 
evaluation  of  the  polygraph  to  adapt  to  individual  differences  in  the  physiological 
responses.  Two  algorithms  were  necessary  for  this  project.  The  first  was  a  parsing 
algorithm  which  preprocessed  polygraph  data  and  extracted  features  from  it.  These 
features  can  be  separated  into  three  domains:  time  domain,  frequency  domain,  and 
correlation  domain.  The  second  was  the  K  nearest  neighbor  fuzzy  classifier  which 
analyzed  the  data  from  the  parsing  algorithm  and  determined  the  possibility  of  deception. 
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1.1  History 


The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception 
occurred  around  1 895  [3],  That  was  the  year  that  Cesar  Lombroso  published  the  results 
of  his  experiments  in  which  a  hydrosphygmograph  was  used  to  measure  the  blood 
pressure-pulse  changes  of  criminals  in  order  to  determine  whether  or  not  they  were 
deceptive.  Although  the  hydrosphygmograph  was  originally  intended  to  be  used  for 
medical  purposes,  Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may 
have  been  the  first  to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a 
suspect  a  series  of  photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the 
suspect  did  not  react  more  to  the  victims  picture  than  the  pictures  of  the  other  children, 
Lombroso  concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and 
therefore  was  not  the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by 
measuring  recorded  respiration  tracings  [4].  He  found  that  if  the  length  of  inspiration 
were  divide  by  the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before 
lying  and  also  before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson 
constructed  an  instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and 
respiration  during  an  examination  [3][4].  Larson  reported  accurate  results  which 
prompted  Leonarde  Keeler  to  construct  a  better  version  of  this  instrument  in  1926  [3][4]. 

The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the 
century.  It's  usefulness,  however,  did  not  become  evident  until  the  1930's  during  which 
time  several  articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New 
York  [4].  In  these  articles  he  reports  over  90  criminal  cases  in  which  examination  using 
the  galvanic  skin  response  had  all  been  successful  and  confirmed  by  confession  or 
supplementary  evidence.  The  usefulness  of  the  galvanic  skin  response  prompted  Keeler 
to  add  an  galvanometer  to  his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler 
Polygraph  recorded  blood  pressure-pulse,  respiration,  and  galvanic  skin  response  [3]. 

1.2  Modern  Test  Formats 

The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format 
that  is  used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant 
questions  about  an  issue,  control  questions  that  provide  a  physical  response  for 
comparison,  and  irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response 
for  comparison  [1][4].  Three  general  types  of  test  formats  are  in  use  today.  These  are 
Control  Question  Tests,  Relevant-Irrelevant  Tests,  and  Concealed  Knowledge  Tests. 

Each  of  the  general  test  formats  may  have  a  number  of  more  specific  variations.  Each  test 
consists  of  two  to  five  charts  containing  a  prescribed  series  of  questions.  The  test  format 
that  is  used  in  an  examination  is  determined  by  the  test  objective  [3][4]. 

The  concealed  knowledge  test,  also  called  peak  of  tension  test,  is  used  when  facts 
about  a  crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a 
subject  would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example, 
if  a  gun  was  used  in  a  crime  and  the  public  did  not  know  the  caliber,  an  examiner  could 
ask  a  suspect  if  it  was  a  22  caliber ,  a  38  caliber,  or  a  9mm.  If  the  gun  used  was  a  9mm 
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and  the  suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of 
deception. 

A  control  question  test  is  often  used  in  criminal  investigations.  In  this  type  of  test 
a  series  of  relevant,  irrelevant,  and  control  questions  are  asked.  A  relevant  question  is  one 
which  is  specific  to  the  crime  being  investigated.  For  example, "  Did  you  molest  the 
child?".  A  control  question  is  designed  to  make  the  subject  feel  uncomfortable.  It  is  not 
specific  to  the  crime  being  investigated  however  it  may  be  related  in  an  indirect  way.  A 
control  question  that  could  follow  the  relevant  question  stated  above  is  "Have  you  ever 
forced  yourself  on  another  person  sexually  ?".  The  control  questions  are  compared  to  the 
relevant  questions  and  if  the  responses  to  the  relevant  questions  are  greater,  the  subject  is 
usually  classified  as  deceptive.  Irrelevant  questions  are  used  as  bufiers.  Examples  of 
irrelevant  questions  are  "Are  the  lights  in  this  room  on?"  or  "Is  today  Monday?". 

Relevant-Irrelevant  tests  are  usually  used  to  test  people  trying  to  obtain  security 
clearance  or  get  a  job.  In  this  test,  relevant  questions  are  compared  to  irrelevant 
questions.  Very  few  control  questions  are  asked.  The  purpose  of  control  questions  in  this 
test  is  to  make  sure  that  the  subject  is  capable  of  reacting  at  all. 

1.3  Present  Day  Equipment 

The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in 
1945  and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [1][1 1].  The 
Reid  polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biological  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 
The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way.  The  Axciton  and  Reid  polygraphs  are  shown  in 
figures  1  and  2  respectively. 

Both  machines  record  the  same  biological  signals  using  standard  methods.  Blood 
pressure  is  measured  by  placing  a  standard  blood  pressure  cuff  on  the  arm  over  the 
brachial  artery.  Respiration  is  monitored  by  placing  rubber  tubes  around  the  abdominal 
area  and  the  chest  of  the  subject.  This  results  in  two  signals,  an  upper  and  lower 
respiratory  signal.  Skin  conductivity  is  measured  by  placing  electrodes  on  two  fingers  of 
the  same  hand. 
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Figure  1  Axciton  Polygraph  [1] 


Figure  2  Reid  Polygraph  [3] 
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2.1  Fuzzy  Set  Theory 


In  1965  fuzzy  sets  were  introduced  by  Lofti  Zadeh  [5][6].  They  provided  a  new 
way  to  represent  vagueness  and  made  description  of  many  situations  much  easier.  For 
example,  it  is  not  practical  to  say  that  all  temperatures  below  72  degrees  Farenheit  are 
cold  and  all  temperatures  above  are  hot.  Instead,  temperatures  between  50  and  72  would 
by  described  as  cool,  temperatures  between  30  and  50  would  be  considered  cold,  and 
anything  below  30  would  be  very  cold.  One  way  to  describe  this  situation  is  through  the 
use  of  fiizzy  set  theory.  In  fuzzy  set  theory  an  element  is  not  defined  as  belon^ng  or  not 
belonging  to  a  given  set.  Instead,  it  has  a  degree  of  membership  in  a  set  which  is 
characterized  by  a  compatibility  function  uA  [6]  [7],  The  compatibility  function,  also 
called  a  membership  function,  states  the  degree  of  membership  in  a  set  "A"  and  has  a 
range  [0,1].  An  illustration  of  how  this  applies  to  the  temperature  example  above  is 
illustrated  in  figure  1  and  described  below. 


30  72  100 

Figure  3  Compatibility  functions  ucold(T)  and  uhot(T)  vs.  temperature. 

Here,  and  u^,(T)  are  the  degrees  of  membership  in  each  set  and  T  is  the 

temperature  in  Farenheit.  Figure  1  shows  that  the  temperatures  around  72  degrees  have 
membership  in  u^y(T )  and  Uf^,(T).  These  memberships  have  values  around  .5  which 
represents  cool  or  warm.  As  the  cooler  temperatures  decrease,  u^(T)  increases  thus 
representing  a  colder  situation.  Once  the  temperatures  become  less  than  30  degrees, 
u^y(T )  obtains  a  membership  value  of  1  which  indicates  very  cold  temperatures. 

Fuzzy  set  theory  is  often  thought  of  as  another  form  of  probability  theory.  In 
actuality,  the  two  are  very  different  [8].  In  Bayesian  probability  theory,  elements  either 
belong  or  do  not  belong  to  a  given  set,  and  a  probability  density  function  determines  the 
likelihood.  For  example,  a  light  may  be  either  on  or  oflT  and  the  probability  of  either  event 
occurring  will  depend  on  some  statistical  parameters  ( Is  the  room  occupied?  Is  it  dark 
out?  etc.).  The  following  is  an  example  of  the  difference  between  fuzzy  logic  and 
Bayesian  probability  theory  [6]. 
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Example  1 


Let  L  =  set  of  all  liquids,  and  let  fuzzy  subset  1  =  {all  (potable)  liquids}. 
Suppose  you  had  been  in  the  desert  for  a  week  without  drink  and  you  came  upon  two 
bottles  marked  "C  and  "A"  as  in  figure  4a. 


Figure  4a  Liquids  before  observation 


Confronted  with  this  pair  of  bottles,  and  given  that  you  must  drink  from  the  one 
that  you  choose,  which  would  you  choose  to  drink  fi-om?  Most  readers,  when  presented 
with  this  experiment,  immediately  see  that  while  "C"  could  contain,  say,  swamp  water,  it 
would  not  (discounting  the  possibility  of  a  Machiavellian  fuzzy  modeler)  contain  liquids 
such  as  hydrochloric  acid.  That  is,  membership  of  0.91  means  that  the  contents  of  "C"  are 
fairly  similar  to  perfectly  potable  liquids  (e.g.,  pure  water).  On  the  other  hand,  the 
probability  that "  A"  is  potable  =  0.91  means  that  over  a  long  run  of  experiments,  the 
contents  of  A  are  expected  to  be  potable  in  about  91%  of  the  trials;  in  the  other  9%  the 
contents  will  be  deadly  -  about  1  chance  in  10.  Thus,  most  subjects  will  opt  for  a  chance 
to  drink  swamp  water. 

There  is  another  facet  to  this  example,  and  it  concerns  the  idea  of  observationion. 
Continuing  then,  suppose  that  we  examine  the  contents  of  "C"  and  "A"  and  discover  them 
to  be  as  shown  in  figure  4b.  Note  that,  after  observation,  the  membership  value  for  "C"  is 
,  unchanged  while  the  probability  value  for  A  drops  from  0.91  to  0.0. 
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Figure  4b  Liquids  after  observation 


This  example  shows  that  these  two  models  possess  philosophically  different  kinds 
of  information:  memberships,  which  represent  similarities  of  objects  to  imprecisely 

defined  properties,  and  probabilities,  which  convey  information  about  relative  frequencies. 


3.1  MGQT 


The  test  format  used  in  this  project  was  the  MGQT  test  format.  It  is  a  type  of 
control  question  test  in  which  relevant,  irrelevant,  and  control  questions  are  asked  in  the 
order  given  in  table  1  [9]  [12],  Before  each  test,  the  questions  that  will  be  asked  are 
discussed  with  the  subject.  The  series  of  questions  is  asked  three  times  in  the  order 
specified  in  table  1 .  This  produces  three  test  charts.  The  examiner  waits  about  20 
seconds  between  each  question. 

Not  all  of  the  Axciton  charts  used  in  this  study  follow  the  format  of  table  1  exactly. 
Many  examiners  rearranged  the  order  in  which  the  questions  were  asked.  All  polygraph 
charts  used,  however,  were  variations  of  this  test.  For  example,  one  examiner  used  a  test 
format  in  which  questions  3  and  4  were  switched.  Many  of  the  examiners  changed  the 
order  in  which  the  questions  were  asked  in  the  second  and  third  charts. 


Table  1  MGQT  question  format 


4.1  File  Formats 

Axciton  files,  digitized  polygraph  data  fi-om  the  axciton  polygraph,  were  obtained 
from  the  National  Security  Agency  (NSA)  in  standard  MSDOS  format.  The  sampling 
fi-equency  of  the  data  was  30Hz.  Each  test  consisted  of  nine  files.  The  labling  of  the  files 
is  shown  in  table  2  and  the  purpose  of  each  file  is  explaned  below. 


Chart  1 

SSxxxxxx.Oll 

$$xxxxxx.012 

$$xxxxxx.013 


Chart! 

$$xxxxxx.021 

$$xxxxxx.022 

$$xxxxxx.023 


Chart  3 

$Sxxxxxx.03 1 
$$xxxxxx.032 
$$xxxxxx.033 


Table  2  File  format 
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As  stated  in  the  section  above,  each  examination  is  composed  of  three  charts.  The 
chart  number  is  specified  by  the  second  number  after  the  period.  The  third  number  after 
the  period  represents  the  type  of  file, 

$$xxxxxx  .Oxl  is  the  event  marker  file  which  contains  the  length  of  the  chart  and 
the  event  markers.  The  start  and  end  of  an  examiners  question  is  marked  with  a  0  and  1, 
respectively.  The  beginning  of  the  subjects  response  is  indicated  with  a  2  and  the  rest  of 
the  file  is  marked  with  9's.  File  $$xxxxxx.0x2  is  the  file  containing  the  biological  signals. 
These  signals  correspond  to  the  marker  file.  File  $$xxxxxx.0x3  contains  the  questions  and 
labels  them  relevant,  irrelevant,  or  control. 

An  ASCII  file  of  five  columns  is  created  by  using  SSxxxxxx.Oxl  and  $$xxxxxx.0x2 
and  a  program  provided  by  the  NSA.  An  example  of  this  file  along  with  a  description  of 
the  function  of  each  file  is  shown  in  table  3  [12], 


Event  Marker 

FileChart  Data 

FileQuestion  TextFile 

SSxxxxxx.Oxl 

$$xxxxxx.0x2 

$$xxxxxx.0x3 

Aiciton 

File 

Contains  the  length  of 
the  chart,  the  number 
of  channels,  and  the 
position  of  the  e%^ent 
marker. 

Contains  the  digitized 
series  values  formatted 
according  to  flags  in  the 
Event  Marker  File. 

Contains  the  script  of 
of  questions  or  a 
shorthand  script  of 
questions. 

Processing 

Notes 

Becomes  the  5th 
column  of  ASCII  file, 
0=start  of  a  question 
l=end  of  a  question 
2=start  of  response 
9=No  Event  Marker 

Becomes  lst-4lh  columns 
of  ASCII  file. 

Column  1-GSR 

Column  2-Cardio 

Column  3 -Upper  Resp 
Column  4-Lower  Resp 

Files  used  to 
determine  deviations 
from  standard  test 
format. 

ASCn  File  Format  (with  column  labels) 


File  Row  GSR  Cardio 

UR 

LR 

EvMark 

DOS 

1 

1983 

1931 

1482 

1083 

9 

File 

2 

1983 

1922 

1483 

1084 

9 

3 

1983 

1913 

1483 

1084 

9 

4 

1983 

1906 

1483 

1085 

9 

Table  3  File  description  and  example 
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5.1  Preprocessing 


MATLAB  was  used  to  display  the  signals  and  implement  all  of  the  filters  and 
feature  extraction  algorithms.  First,  the  four  biological  signals  were  processed  into  six 
channels.  Hamming  windowed  FIR  filters  were  used  to  create  these  channels  and 
eliminate  noise.  A  low  fi-equency  cardiovascular  channel  was  produced  by  lowpass 
filtering  the  cardiovascular  signal  at  .5  Hz  using  a  134  tap  lowpass  filter.  Then,  a  high 
fi’equency  cardiovascular  channel  was  produced  by  highpass  filtering  the  cardiovascular 
signal  at  .5  Hz  using  a  134  tap  highpass  filter.  The  derivative  of  the  low  frequency 
channel  was  then  used  to  create  a  third  channel.  To  eliminate  noise,  the  upper  and  lower 
respiratory  signals  were  lowpass  filtered  at  1.2  Hz  using  a  160  tap  filter.  Noise  was 
eliminated  from  the  galvanic  skin  response  by  using  a  100  tap  lowpass  filter  with  a  cutoff 
frequency  of  .5  Hz.  Any  DC  trends  that  existed  within  a  chart  were  eliminated  using  the 
detrend  function  in  MATLAB.  This  function  finds  the  best  straight  line  fit  to  the  data  and 
then  subtracts  the  line  from  the  data.  Each  signal  was  normalized  by  dividing  by  its 
standard  deviation.  The  raw  data  and  results  of  this  processing  are  shown  in  figures  5-14. 

Fragments  of  each  signal  were  accessed  before  features  were  extracted.  These 
fragments  were  successfully  used  by  Brian  M.  Duston  of  the  Naval  Control  and  Ocean 
Surveillance  Center  in  his  study  and  are  given  in  table  4  [9].  The  start  and  end  points 
given  in  table  4  refer  to  the  time  elapsed  after  the  question  was  asked  by  the  examiner. 


Channel 

Start 

End 

GSR 

2  sec. 

14  sec. 

Upper  respiratory 

2  sec. 

18  sec. 

Lower  respiratory 

2  sec. 

18  sec. 

Low  frequency  cardiovascular 

2  sec. 

18  sec. 

High  frequency  cardiovascular 

3  sec. 

9  sec. 

Derivative  of  low  frequency  cardiovascular 

0  sec. 

8  sec. 

Table  4  Time  fragments  used  in  feature  extraction 
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Figure  11  Lower  Respiratory 


Figure  12  Preprocessed  Lower  Respiratory 
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5.2  Time  Domain  Feature  Extraction 


Many  of  the  time  domain  features  were  chosen  by  talking  to  examiners  and  finding 
out  what  was  important  to  them  in  an  examination  [10][1 1].  One  feature  examiners  use  to 
determine  deception  involves  the  height  of  the  peaks  in  the  respiratory  signal.  If  the  peaks 
become  smaller  or  staircase  during  a  relevant  question  there  is  a  good  chance  that  the 
subject  is  being  deceptive.  From  looking  at  different  polygraph  charts  it  could  be  seen  that 
indiv’idual  reactions  may  vary  slightly  with  time.  For  this  reason,  many  features  were 
extracted  from  the  respiratory  channels  in  order  to  determine  if  the  deceptive 
characteristics  described  above  may  be  present.  One  feature  extracted  fi'om  the 
respiratory  signal  was  the  average  height  of  the  peaks.  Because  the  time  fi'agments  fi'om 
which  the  features  are  extracted  remain  constant,  this  feature  may  not  give  good  results 
for  subjects  reacting  early  or  late.  For  this  reason,  the  minimum  peak  height  was  also  used 
as  a  feature. 

To  try  and  capture  the  effect  of  staircasing,  the  average  of  the  derivative  of  the 
amplitudes  of  the  peaks  was  used  as  feature.  To  compensate  for  early  and  late  reactions, 
the  maximum  of  the  derivative  of  the  amplitudes  of  the  peaks  was  also  used  as  a  feature. 

Another  respiratory  feature  used  in  this  project  was  the  curve  length.  This  feature 
was  successfully  used  and  researched  by  Howard  Timm  in  the  early  1980's[10][13]. 
Interest  in  curve  length  lead  to  curiosity  about  the  area  under  the  respiratory  curve.  For 
this  reason  it  w'as  also  extracted  to  see  if  it  could  be  used  as  a  feature.  Because  people 
tend  to  breath  quicker  when  they  are  stressed  or  nervous,  the  number  of  peaks  produced 
during  a  given  period  of  time  was  used  as  a  feature. 

Because  it  was  one  of  the  first  features  used  to  successfully  determine  deception, 
Benussi's  I/E  ratio  was  tested  [3]  [4].  Benussi's  method  requires  that  the  I/E  ratio  of  the 
subject  is  calculated  before  and  after  the  examiner  asks  a  question.  The  value  of  the  I/E 
ratio  calulated  after  the  question  is  asked  is  then  divided  by  the  value  of  the  I/E  ratio 
before  the  question  is  asked.  According  to  Benussi's  findings,  if  the  ratio  is  greater  than 
one,  the  subject  is  deceptive.  In  an  attempt  to  reduce  the  number  of  computations 
required  for  Benussi's  method,  a  modification  of  Benussi's  feature  was  tested.  In  the 
modification  of  Benussi's  test,  the  ratio  was  taken  only  after  the  question  was  asked  and 
w'as  not  compared  to  the  subjects  I/E  ratio  before  the  question  was  asked. 

The  examiners  we  spoke  to  would  usually  try  to  find  evidence  of  deception  in 
respiratory-  signals  first.  If  a  subject  did  not  show  a  strong  respiratory  response  however, 
the  examiner  would  analyze  the  subjects  cardiovascular  response.  Because  a  subjects 
heart  rate  will  often  increase  when  deceptive,  the  number  of  peaks  in  the  high  frequency 
cardiovascular  signal  was  used  as  a  feature.  From  looking  at  many  charts,  it  became 
evident  that  some  of  the  processing  used  in  extracting  features  from  the  respiratory 
channels  would  also  be  useful  in  determining  deception  fi'om  the  high  frequency 
cardiovascular  channel.  For  this  reason,  the  average  of  the  peak  height,  minimum  of  the 
peak  height  and  curve  length  were  extracted  fi'om  the  high  fi-equency  cardiovascular 
channel  in  order  to  determine  if  they  would  be  useful  features. 

Many  of  the  standard  statistcal  features  used  in  other  computerized  polygraph 
algorithms  were  also  examined  [9].  These  features  included  the  mean,  the  standard 
deviation,  the  maximum  amplitude,  and  the  minimum  amplitude  of  the  signal.  Variations 


15 


of  these  such  as  the  minimum  subtracted  from  the  maximum  were  also  examined. 
Although  the  original  use  of  the  curve  length  and  area  was  to  determine  deception  from 
the  respiratory  channel,  it  was  extracted  from  the  GSR  and  cardiovascular  channels  as 
well.  It  was  not  possible  from  looking  at  the  signals  to  determine  if  the  curve  length  had 
changed,  but  almost  any  change  in  a  signal  would  affect  this  feature.  A  list  of  the  features 
extracted  from  each  channel  are  given  in  table  5.  The  programs  used  to  extract  these 
features  were  written  in  MATLAB  and  are  included  in  the  appendix  of  this  report. 


High  freouencv  cardiovascular 

1)  mean  of  signal 

2)  standard  deviation  of  signal 

3)  minimum  value  of  signal 

4)  maximum  value  of  signal 

5)  curve  length  of  signal 

6)  area  under  signal 

7)  average  amplitude  of  peaks 

8)  minimum  amplitude  of  peaks 

9)  derivative  of  the  amplitudes  of 
the  peaks  in  the  signal 

10)  number  of  peaks  in  the  signal 

1 1)  minimum  subtracted  from  maximum 

Low  frequency  cardiovascular 

1)  mean  of  signal 

2)  standard  deviation  of  signal 

3)  minimum  value  of  signal 

4)  maximum  value  of  signal 

5)  curve  length  of  signal 

6)  area  under  signal 

7)  minimum  subtracted  from 
maximum 

Upper  and  lower  respiratory 

1)  mean  of  signal 

2)  standard  deviation  of  signal 

3)  minimum  value  of  signal 

4)  maximum  value  of  signal 

5)  curve  length  of  signal 

6)  area  under  signal 

7)  average  amplitude  of  peaks 

8)  minimum  amplitude  of  peaks 


QSR 

1)  mean  of  signal 

2)  standard  deviation  of  signal 

3)  minimum  value  of  signal 

4)  maximum  value  of  signal 

5)  curve  length  of  signal 

6)  area  under  signal 

7)  minimum  subtracted  from 
maximum 


Derivative  of  low  frequency 

1)  mean  of  signal 

2)  standard  deviation  ofsignal 

3)  minimum  value  of  signal 

4)  maximum  alue  of  signal 

5)  curve  length  of  signal 

6)  area  under  signal 

7)  minimum  subtracted  from 
maximum 


9)  derivative  of  the  amplitudes  of 
the  peaks  in  the  signal 

10)  number  of  peaks  in  the  signal 

1 1)  inhalation/exhilation  ratio 

12)  ratio  of  inhalation  ratios  before 
and  afrer  a  question  is  asked 

13)  minimum  subtracted  from 
maximum 


Table  5  List  of  time  domain  features 
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5.3  Feature  Extraction  Methods 


To  extract  the  follov*ing  features  which  are  listed  in  table  5,  (respiratory  7,  8,9,10 
,  1 1  and  high  frequeny  cardiovascular  7,  8,  9),  it  was  necessary  to  locate  the  peaks  of  the 
respiratory  and  the  high  frequency  cardiovascular  signals.  This  was  not  a  trivial  task 
because  these  signals  contained  low  amplitude  high  frequency  noise  which  was  diflBcult  to 
eliminate  without  distorting  the  data  (see  figures  8,10,  and  12).  In  order  to  find  the  useful 
peaks,  two  programs  were  written.  The  program  that  found  the  peaks  of  the  respiratory 
signal  was  titled  peaklr  and  the  program  that  found  the  peaks  in  the  cardiovascular  signal 
titled  peakcard.  Both  programs  can  be  found  in  the  appendix.  The  way  that  these 
programs  find  peaks  is  as  follows;  The  second  derivative  was  taken  and  points  that  had 
values  equal  to  zero  were  labeled  as  peaks.  The  amplitudes  of  the  signal  at  points  near 
these  peaks  were  evaluated  and  the  maximum  of  these  values  were  labeled  as  peaks. 

In  order  to  eliminate  the  effects  of  the  low  amplitude  high  frequency  noise,  it  was 
necessary  to  check  the  amplitude  of  data  points  that  were  near  each  point  that  had  been 
labeled  as  a  peak.  The  number  of  the  data  points  from  the  peaks  that  were  determined  by 
the  second  derivative  was  chosen  by  examining  many  respiratory  and  cardiovascular 
signals  and  determining  the  average  width  of  the  peaks  in  these  signals.  It  was  found  that 
twenty  points  on  each  side  of  the  each  peak  found  by  the  second  derivative  was  a 
satisfactory  range  for  the  respiratory  signals.  Similarly  eight  points  on  each  side  of  the 
initial  peak  gave  would  satisfy  this  criterion  for  the  cardiovascular  signal.  All  of  the 
routines  used  to  perform  these  operations  are  in  appendix  B  (see  peak.m,  peakcard.m,  and 
peaklr.  m). 

In  order  to  determine  the  I/E  ratio,  it  was  necessary  to  find  the  valleys  of  the 
respiratory  signals  as  well  as  the  peaks.  The  method  used  to  find  the  valleys  was  the  same 
as  that  used  to  find  the  peaks  (see  appendix  B  valley.m  and  valleylr.m).  The  I/E  ratio  was 
found  by  the  following  method.  First  the  time  that  a  valley  occurred  was  subtracted  from 
the  time  that  a  peak  occurred.  Then  the  time  that  the  peak  occurred  was  subtracted  from 
the  time  that  the  next  valley  occurred.  The  first  value  was  then  divided  by  the  second 
value  (see  appendix  B  ie.m  and  ieie.m). 


6.1  Conclusion 

A  vector  of  features  was  created  by  the  program  featurev.m  which  first  executed 
all  of  the  preprocessing  routines.  The  program  then  extracted  features  for  all  of  the 
questions  using  the  times  specified  in  table  4.  This  program  extracted  features  from  all 
polygraph  files  in  a  directory  and  produced  a  set  of  vectors.  These  vectors  were  then  used 
for  training  and  testing  of  a  fiizzy  K  nearest  neighbor  classifier.  For  details  on  the 
methods  used  for  training  and  testing  as  well  as  the  frequency  and  correlation  domain 
features  used  in  the  study  refer  to  Dastmalchi  [14],  For  details  on  the  K  nearest  neighbor 
algorithm  refer  to  Layeghi  [15]. 
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Appendix  A 


Preprocessing  Programs 


DERCD.M 


function  y  =  dercd(var) 

%  This  extracts  the  derivative  of  a  lowpass 
%  filtered  version  of  the  cardio  signal. 

% 

%  To  use  this  command  the  user  must  enter  the  file  name 
% 

%  eg.  dercd (variable  name) 

q  =  detlc(var);  %  detrends  the  lower  frequencies 

%  of  the  cardio  signal 

e  =  diff (q) ;  %  differentiates  the  lower 

%  frequencies  of  the  cardio  signal 

X  =  e/std (e) ; 
y  =  [x* ,x( length (X) ) ' ] ' ; 


Page  1 


d(>  &P  &(> 


DETGSR.M 


function  y  =  detgsr(var) 

This  function  detrends  the  gsr 

To  use  this  command  the  user  must  enter  the  file  name 
eg.  detgsr(file  name) 
dtrnd  =  detrend (var( 1) ) ; 

%  elliminates  dc  trends  in  signal 
%  eg.  a  line  added  to  the  signal 

window  =  100; 

dtrnd  =  [dtrnd',  zeros (window/ 2  -  1,1)']'; 

%  adds  zeros  to  end  of  signal  so  that  no 
%  information  is  lost  during  filter  delay 


b  =  firl (window, . 03) ; 

X  =  f ilter (b, 1, dtrnd) ; 
q  =  x/std(x) ; 

1  =  length (q); 

y  =  q (window/2 : 1) ;  %  compensate  for  time  delay 
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DETHIC.M 


function  y  =  dethic(var) 

%  This  function  detrendeds  the  high  frequencies 
%  of  the  cardio  signal. 

% 

%  To  use  this  command  the  user  must  enter  the  file  name 
% 

%  eg.  dethic(file  name) 

dtrnd  =  detrend(var ( : ,2) ) ;  %  elliminates  dc  trends  in  signal 

%  eg.  a  line  added  to  the  signal 


window  =  134; 

dtrnd  =  [dtrnd*,  zeros (window/2  -  1,1)']'; 

%  adds  zeros  to  end  of  signal  so  that  no 
%  information  is  lost  during  filter  delay 


b  =  fin  (window, .  035, 'high' ) ; 

%  filter  to  elliminate  low  frequencies 

X  =  f ilter (b, 1, dtrnd) ; 
q  =  x/std(x) ; 

1  =  length (q); 

y  =  q (window/2 : 1) ;  %  compensate  for  time  delay 
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DETLC.M 


function  y  =  detlc(var) 


%  This  function  extracts  and  detrends  the  low 
%  frequencies  of  the  cardio  signal 
% 

%  To  use  this  command  the  user  must  enter  the  file  name 
% 

%  eg.  detlc(file  name) 


dtrnd  =  detrend(var ( : ,2) ) ; 
window  =  134; 


%  elliminates  dc  trends  in  signal 
%  eg.  a  line  added  to  the  signal 


dtrnd  =  [dtrnd',  zeros (window/2  -  1,1)']'; 

%  adds  zeros  to  end  of  signal  so  that  no 
%  information  is  lost  during  filter  delay 

b  =  fin  (window,  .035)  ;  %  filter  to  elliroinate  high  frequencies 

X  =  filter (b, 1, dtrnd) ; 
q  =  x/std(x) ; 


1  =  length (q); 

y  =  q(window/2 : 1) ;  %  compensate  for  time  delay 
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DETLR.M 


function  y  =  detlr(var) 

This  function  extracts  and  detrends  the  lower  respiratory  signal 
To  use  this  command  the  user  must  enter  the  file  name 


eg.  detltr(file  name) 


dtrnd  =  detrend(var ( : ,4) ) ;  %  elliminates  dc  trends  in  signal 

%  eg.  a  line  added  to  the  signal 


window  =  240; 


dtrnd  =  [dtrnd',  zeros (window/ 2  -  1,1)']'; 

%  adds  zeros  to  end  of  signal  so  that  no 
%  information  is  lost  during  filter  delay 


b  =  fin  (window,  .083) ;  %  filter  to  elliminate  noise 

X  =  f ilter (b, 1, dtrnd) ; 
q  =  x/std(x) ; 

1  =  length (q); 

y  =  q (window/2 : 1) ;  %  compensate  for  time  delay 
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DETUR.M 


function  y  =  detur(var) 


%  This  function  detrends  the  upper  respiratory  signal 
% 

%  To  use  this  command  the  user  must  enter  the  file 
% 

%  eg.  detur(file  name) 


dtrnd  =  detrend ( var (: ,3) ) ; 
window  =  240; 


%  elliminates  dc  trends  in  signal 
%  eg.  a  line  added  to  the  signal 


dtrnd  =  [dtrnd',  zeros (window/ 2  -  1,1)']'; 

%  adds  zeros  to  end  of  signal  so  that  no 
%  information  is  lost  during  filter  delay 


b  =  fin  (window,  .08)  ;  %  filter  to  elliminate  noise 

X  =  filter (b, 1, dtrnd) ; 
q  =  x/std(x) ; 

1  =  length (q); 

y  =  q(window/2:l) ;  %  compensate  for  time  delay 
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Appendix  B 


Feature  Extraction  Programs 


function  [x,y,z]  =  featurev(file_name,relevant,iiielevant,conlrol, features) 

%  This  function  produces  a  feature  vector  for  a  given  file 
%  RelcN'ent,  irrelev  ent,  and  control  are  vectors  which  contain 
%  the  questions  these  features  are  extracted  fi-om. 

% 

%  eg.  featurev(t79,[3  5],[1  4],  [6  10],fcature_list) 

%  The  above  example  gives  the  features  for 
%  the  file  t79  of  the  3rd  and  5th  question  which  are  relevent  in  this 
%  MGQT  format,  the  1st  and  4th  question  which  are  irrelevent 
%  and  the  6th  and  10th  questions  which  are  control 

%  feature_list=[TOmean(frag  ) 

%  '20curve(frag )’; 

%  '30area(frag ) ']; 


feature_list  =  features 


%  The  channels  are  ordered  as  follows: 

%  1:GSR,  2;HiC::ardio,  3:LowCardio,  4:DerLowCardio,  5;LowResp,  6:UpResp 


%  This  is  a  matrix  of  the  time  delay  after  asking  a  question  to  start  of  extracting 
%  the  feature,  and  finish  extracting  the  feature  for  each  channel. 

Times=[2,  14; 

3.9; 

2,  18; 

0.8; 

2,18; 

2,  18]; 

%  These  are  preprocessing  functions. 

Preprocess=[  'detgsr'; 

•dethic*; 

•detlc  •; 

'dercd '; 

•detlr*; 

'detur  ’]; 

data=zeros(6,length(file_name(:,5))); 

%  Standardize  and  detrend  the  channels  and  derive  new  channels 
for  i=l:6, 

data(i,:)=eval([Preprocess(i,:),’(file_name)l)’; 


end 


marker  =  file_name(:,5);  %  0  begin  test  and  end  test 
%  0  examiner  begins  asking  question 
%  1  examiner  finishes  asking  question 
%  2  subject  begins  response  to  question 
%  9  does  not  mark  an  event 

begin  =  find(marker  =  0);  %  finds  indecies  where  marker  =  0  (question  begins) 

begin=begin(2:length(begin));  %  elliminates  the  marker  at  the  ^ginning  of  the  test 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
%  This  for  loop  creates  feature  veaors  for  each  relevant  quesion 
% 

%  eg  X  =  [mean(gsr),std(gsr),area(gsr),mean(lr),std(lr),area(lr),etc . 

%  curv'e  length, amplitude  of  peaks,#  of  peaks] 

%%%%Vo%%%%%%VoVo%yo%yo%%%Vo%%%%%%%%%yoVo%%%%%%%%%%%%%%%%%yo 


feature_count=l; 

for  i  =  l:length(relevant), 

queslion=relevant(i); 

for  j=l  :length(feature_list(;,  1)) 

channel_number=e\’a](feature_list(j,  1 )); 
second_channel=e\'al(feature_list(j,2)); 
st=begin(question)+30*Times(chaiuiel_number,  1 ); 
fn=begin(question)+30*Times(channel_number,2); 
st2=begin(question)-30*Times(chaimel_number,2); 
fn2=begin(question)-30*Times(channel_number,l); 
fi=feature_list(j,3  :length(feature_list(  1 ,:))); 
fTag=data(channel_number,st;fh); 
frag2  =  data(channel_number,st2:fii2); 
if  second_channel  0 

st3=begin(question)+30*Times(second_chaimel,l); 

fti3=^)egin(question)+30*Times(second_chaimel,2); 

fTag3  =  data(second_channel,st3:fii3); 
end 

tempj^walCfr); 
for  m  =  l:length(tempy) 

x(feature_count)  =  tempy(m); 

feature_count=feature_count+l; 

end 

end 

end 

%  Irrelevant  questions 

feature_count=l; 

for  i  =  l;length(irrelevant), 

question=irrele\’ant(i) ; 

for  j= I  :length(feature_list(:,  1 )) 

channel_number=e\’al(feature_list(j,  1 )); 


second_channel“eval(feature_list(j,2)); 
st==begin(question)+30*Times(chaiinel_number,l); 
fh=begin(question)+30*Times(channel_number,2); 
st2=begin(question)-30*Times(channel_number,2); 
fh2=begin(question)-30*Times(channel_number,l); 
fr=feature_list(j,3:length(feature_list(l,:))); 
frag=<lata(channel_number,st;fii); 
frag2  =  data(channel_number,st2:fii2); 
if  second_channel  -=  0 

st3=begin(question)+30*Times(second_channel,  1); 
fii3=begin(question)+30*Times(second_chaimeii2); 

firag3  =  data(second_chanDeI,st3:fia3); 
end 

temp>’=cval(fr); 
for  m  =  1  :length(tempy) 
y(feature_count)  =  tcmpy(in); 

feature_coimt=feature_count+ 1 ; 
end 
end 


o/o. - 

%  Control  questions 

feature_cx)unt=l; 

for  i  =  l:length(control), 

question=control(i); 

for  j= 1  :length(feature_Iist(:,  1 )) 

channel_number=eval(feature_list(j,  1 )); 
second_channel=eval(feature_list(j,2)); 
st=begin(question)+30*Times(channel_number,  1 ); 
fn=begin(question)+30*Times(channel_nuinber,2); 
st2=begin(question)-30*Times(channel_number,2); 
fn2=begin(question)-30*Times(channel_number,  1 ); 
fT=feature_list(j,3  :length(feature_list(  1 ,;))); 
frag=data(channel_nuinber,st:fii); 
frag2  =  data(channel_number,st2:fit2); 
if  second_channel  ~=  0 

st3=^)egin(question)+30*Times(second_channel,l); 

fii3=begin(question)+30*Tinics(second_channel,2); 

frag3  =  data(second_channel,st3:fii3); 
end 

temp>'=eval(fr); 
for  m  =  1  :length(tempy) 

z(feature_counl)  =  teinpy(m); 

feature_count=feature_count+l ; 
end 
end 


AMPCARD.M 


function  y  =  ampcard(var) 

%  This  function  finds  the  average  of  the  amplitudes 
%  of  the  peaks  in  the  high 

%  cardio  signal  over  a  specified  period  of  time. 

% 

%  To  use  this  command  the  user  must  enter  the 
%  file  name  and  the  start  and  finish  points 
%  of  the  signal  to  be  displayed 

%  eg.  ampcard (variable  name) 

p  =  peakcard(var) ;  %  the  indecies  of  the  peaks 

for  n  =  1; length (p) 

g(n)  =  var(p(n));  %  amplitude  of  the  peaks 


end 

y  =  sum (q) /length (q) ; 
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AMPR.M 


function  y  =  ainpr(var) 

%  This  function  finds  the  average  of  the 
%  amplitudes  of  the  peaks  in  the  lower 
%  respiratory  signal  over  a  specified  period  of  tine. 

%  To  use  this  command  the  user  must 
%  enter  the  variable  name 
% 

%  eg.  ampr (variable  name) 

p  =  peaklr(var);  %  the  indecies  of  the  peaks 

for  n  =  1: length (p) 

q(n)  =  var(p(n));  %  amplitude  of  the  peaks 

end 

y  *=  sum (q) /length (q) ; 
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CURVE. M 


function  y  =  curve (var) 

%  This  function  finds  the  length  of  the  variable 
% 

%  To  use  this  command  the  user  must  enter  the 
%  variable  name  and  the  start  and  finish  points 
%  of  the  signal  to  be  displayed 
% 

%  eg.  curve (variable  name) 


X  =  sqrt (diff (var) . "2  +  1) ; 
y  =  sum (X) ; 
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lE.M 


function  y  =  ie(var) 

%  This  function  takes  the  i/e  ratio  of  the  respiratory  signals, 
% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
% 

%  eg.  ie (variable  name) 

p  =  peaklr(var);  %  finds  the  indices  of 

%  the  peaks  in  a  signal  and  puts  them 
%  in  a  vector  a 

plength  =  length (p); 

V  =  valleylr (var) ;  %  finds  the  indices  of  the 

%  valleys  in  a  signal  and  puts  them 
%  in  a  vector  b 

vlength  =  length (v); 

if  vlength  <  2  |  plength  <2  %  check  that  enough  peaks 

%  and  valleys  exist  for 
%  the  calculation  to  be  done 

message  =  '  Warning  !!!!  Not  enough  data’ 

end 

if  p(l)  >  v(l) 

for  n  =  1: vlength  -  1 
q  =  p(n)  -  v(n) y 


z  =  v(n  +  1)  -  p(n) ; 
e(n)  =  q  ./  z; 

end 

end 

if  p(l)  <  v(l) 

for  n  =  1; vlength  -  1 

q  =  p(n  +  1)  -  v(n);  %  calculates  a  vector  of 

%  e/i  ratios  for  the  peaks 
%  and  valleys  in  the 
%  given  time  period 

z  =  v(n  +  1)  -  p(n  +  1); 


%  calculates  a  vector  of 
%  e/i  ratios  for  the  given 
%  time  period 
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lE.M 


e(n)  =  q  ./  z; 

end 

end 

y  =  niean(e) ; 
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lEIE.M 


function  y  =  ieie(varl,var2) 

%  This  function  takes  the  i/e  ratio  of  the  respiratory  signals 
%  before  and  after  a  question  is  asked.  It  then  divides  the  two 
%  values. 

% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
% 

%  eg.  ieie (variable  namel,  variable  name2) 
a  =  ie(varl) ; 
b  =  ie(var2) ; 
y  =  a/b; 
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PEAK.M 


function  y  =  peak(var) 

%  This  function  finds  the  peaks  in  a  signal  and  returns  the  index 
%  It  also  creates  a  plot  of  the  variable  with  the  peaks  marked 
% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
%  of  the  signal  to  be  displayed 
% 

%  eg.  peak (variable  name) 
q  =  diff (var) ; 
z  =  q>0; 
f  =  diff (2) ; 
a  =  f<0; 
y  =  find(a) ; 


%  differentiates  the  variable 
%  z  =  1  if  q  is  greater  than  0 
%  2nd  derivative  of  the  variable 

%  finds  the  indices  where  the  2nd  derivative 
%  is  -1  which  indicates  peak 
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PEAKCARD.M 


function  y  =  peakcard(var) 

%  This  function  finds  the  peaks  in 
%  the  cardio  signal  and  returns  a  vector  of 
%  indexes  where  they  occur. 

% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
% 

%  eg.  peakcard (variable  name) 


ty  =  peak(var); 

if  ty(l)  <  8 

ty  =  ty (2: length (ty) ) ; 


end 


if  ty (length (ty) )  >  length(var)  -  8 
ty  =  ty(l: length (ty) -1) ; 

end 


for  n  =  1: length (ty) ; 
pan 


%  finds  the  maxim\im  peak  over  a  10  point 


temp  =  var(ty(n)-8  :  ty(n)+8); 

z(n)  =  ty(n)  -  9  +  find(temp  ==  max(temp)); 

%  finds  the  time  that  the  peak 
%  occurs  in  the  original  signal 

end 

for  n  =  1: length (2) -1  %  elliminates  duplicate  indicies 

if  z(n)  ==  z(n+l) 
z(n)  =  0; 

end 

end 

ind  =  find(z);  %  finds  indecies  of  elements 

%  that  are  not  equal  to  zero 

for  n  =  1: length (ind)  %  elliminates  0  elements 

z (n)  =  z (ind(n) ) ; 

end 


Page  1 


PEAKCARD.M 


y  =  z (1: length (ind) ) ; 

%  pmark  =  z er os (1, length ( var) ) ; 
r 

%  pmark (y)  =  ones (1, length(y) ) ; 
%  plot (var, 'r ' ) 

%  title ('Ir  marked  with  peaks’) 
%  hold  on 

%  plot (5*pmark, 'g' ) 

%  hold  off 


%  a  vector  of  I's  where  peaks  occu 
%  O's  everywhere  else 
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PEAKLR.M 


function  y  =  pea]clr(var) 

%  This  function  finds  the  peaks 
%  in  the  Ir  signal  and  returns  a  vector 
%  of  indecies  where  they  occur. 

% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
% 

%  eg.  peaklr (variable  name) 


tb,a]  =  butter (4, .034) ;  %  elliminate  noise 

filtout  =  filtfilt (b,a,var) ; 

ty  =  peak(f iltout) ;  %  finds  the  time  that  the 

%  peaks  of  filtered  Ir  signal  occur 


if  ty(l)  <  20 

ty  =  ty (2 : length (ty) ) ; 

end 

if  ty (length (ty) )  >  length (var)  -  20 
ty  =  ty (l: length (ty) -1) ; 

end 


for  n  =  1 : length (ty) 

temp  =  var (ty (n) -20;ty(n)+20) ; 

z(n)  =  ty(n)  -  21  +  find(temp  “  max(temp)); 

%  finds  the  time  that  the  peak  occurs  in 
%  the  original  signal 

end 

for  n  =  1: length(2) -1  %  elliminates  duplicate  indicies 

if  z(n)  ==  z(n+l) 
z(n)  =  0; 

end 

end 

ind  =  find (2);  %  finds  indecies  of  elements 

%  that  are  not  equal  to  zero 

for  n  =  l:length(ind)  %  elliminates  0  elements 

z (n)  =  z (ind(n) ) ; 


end 
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PEAKLR.M 


(1; length (ind) ) ; 


I 
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PEAKNXJMC.M 


function  y  =  peaknuiac(var) 

%  This  function  finds  the  nu^er  of 
%  peaks  in  the  high  cardio  signal 
% 

%  To  use  this  command  the  user 
%  must  enter  the  variable  name 
% 

%  eg.  peakn\imc (variable  name) 

p  =  peakcard (var) ;  %  the  indecies  of  the  peaks 

y  =  length (p); 
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PEAKNUMR.M 


function  y  =  peaknumr (var) 

%  This  function  finds  the  number 
%  of  peaks  in  the  respiratory  signal 
% 

%  To  use  this  command  the  user 
%  must  enter  the  variable  name 
% 

%  eg.  peaknumr (variable  name) 

p  =  peaklr(var);  %  the  indecies  of  the  peaks 

y  *  length (p) ; 
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TSTFEAT.M 


feature_list=[  '  lOinean(frag) 

' lOcurve (frag)  ' ; 

'lOarea(frag) 

'20jnean(frag) 

'20curve(frag) 

'20area(frag) 

'20ampcard(frag) 

' 2  Opeaknumc ( frag)  ' ; 

' 30roean(frag) 

'30curve(frag) 

'30area(frag) 

'40mean(frag) 

' 40curve (frag) 

'40area(frag) 

'  50inean(frag) 

' SOcurve (frag)  ' ; 

'50area(frag) 

' SOampr (frag) 

' SOpeaknumr ( frag)  ' ; 

'50ie(frag) 

'50ieie(frag,  frag2) 

'  60iiiean(frag) 

'60curve(frag) 

' 60area (frag) 

' eoampr (frag) 

' SOpeaknumr (frag)  ' ; 

'60ie(frag) 

' 60ieie(frag,  frag2)  ']; 

[X  y  z]  =  featurev(t79, [1  2], [3  4], [6  10] , feature_list) 
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VALCARD.M 


function  y  =  valcard(var, start, finish) 

%  This  function  finds  the  valleys  in 

%  the  Ir  signal  and  returns  a  vector  of  indexes  where 
%  they  occur 
% 

%  To  use  this  command  the  user  must  enter  the 
%  file  name  and  the  start  and  finish  points 
%  of  the  signal  to  be  displayed 
% 

%  eg.  valcard(file  name,  start,  finish) 


k  =  hicardio (var, start, finish) ; 


[b,a]  =  butter (4, .034) ;  %  elliminate  high  frequencies 

filtout  =  k;  %  filtfilt(b,a,k) ; 

tv  =  valley (filtout, start, finish)  %  finds  the  time  that  the 

%  peaks  of  filtered  Ir  signal  oc 

cur 

1  =  length (ty); 


for  n  =  1:1 

temp  =  k(max(l,ty(n)-'10+start)  :  min(ty(n)+10+start, length (k) 


if  ty(n)<10 

dd= length (temp) /2+1; 
else 

dd=ll; 

end 

v(n)  =  ty(n)  -  dd  +  find(temp  *=  min(temp)); 

%  finds  the  time  that  the  peak  occurs  in 
%  the  original  signal 

end 

vmark  =  zeros (1, finish  -  start);  %  a  vector  of  I's  where  peaks  occ 
ur 

%  O's  everywhere  else 

vmark (y)  =  ones (1, length (y) ) ; 
subplot  (211) ,  plot  (k(  start ‘.finish) ,  'r' ) 
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VALCARD.M 


title ('Ir  marked  with  peaks*) 
hold  on 

plot (“5*vmark, 'g') 
hold  off 

subplot(212) ,plot (filtout(start:finish) , 
title (' filtered  Ir  marked  with  peaks') 
hold  on 

plot (vmark, 'g* ) 
hold  off 

%  subplot (223) , plot (k (start: finish) , 'r') 
%  hold  on 

%  plot (5*a(l: finish  -  start  -  3),'g') 

%  hold  off 

%  subplot (224) , plot (x) 

%  subplot (111) 
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VALLEY. M 


function  y  =  valley (var) 

%  This  function  finds  the 

%  valleys  in  a  signal  and  returns  the  index 

%  To  use  this  command  the  user 
%  must  enter  the  variable  name 
% 

%  eg.  valley (variable  name) 

q  =  diff(var);  %  differentiates  the  variable 

2=q>0;  %z=lifqis  greater  than  0 

f  =  diff(2);  %  2nd  derivative  of  variable 

a  =  f  >  0;  %  finds  valleys 

y  =  find (a) ;  %  finds  the  indices  where  the  2nd  derivative 

%  is  +1  which  indicates  valleys 
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VALLEYLR.M 


function  y  =  valley Ir(var) 

%  This  function  finds  the  valleys  in 
%  the  Ir  signal  and  returns  a  vector  of 
%  indecies  where  they  occur 
% 

%  To  use  this  command  the  user  must  enter  the  variable  name 
% 

%  eg.  valley Ir (variable  name) 


[b,a]  =  butter (4, .034) ;  %  elliminate  high  frequencies 

filtout  =  f iltfilt(b,a,var) ; 

ty  =  valley (filtout) ;  %  finds  the  time  that  the 

%  peaks  of  filtered  Ir  signal  occur 

for  n  =  1 : length (ty) 

temp  =  var(max(l,ty(n)-20)  :  min (ty(n) +20, length (var) )) ; 

if  ty(n)<20 

dd= length (temp) /2+1; 
else 

dd=21; 

end 

z(n)  =  ty(n)  -  dd  +  find(temp  ==  min(temp)); 

%  finds  the  time  that  the  peak  occurs  in 
%  the  original  signal 

end 

for  n  =  1: length (z)-l  %  elliminates  duplicate  indicies 

if  z(n)  ==  z(n+l) 
z(n)  =  0; 

end 

end 

ind  =  find(z);  %  finds  indecies  of  elements 

%  that  are  not  equal  to  zero 

for  n  =  1: length (ind)  %  elliminates  0  elements 

z (n)  =  z (ind(n) ) ; 


end 
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2 (1: length(ind) ) ; 
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0  Introduction 


The  polygraph  examination  is  one  of  the  most  popular  methods  to  measure  deception. 
Polygraph  tests  are  used  in  criminal  investigations  to  determine  if  a  suspect  is  being 
deceptive  when  answering  the  questions  concerning  a  crime.  During  a  polygraph  test,  the 
subject  is  asked  a  series  of  control,  relevant  and  irrelevant  questions  that  provide 
physiological  responses  for  comparison  with  question  that  are  relevant  to  the  investigation. 
The  three  physiological  responses  that  are  currently  measured  are  electrocardiogram, 
galvanic  skin  response  and  respiration.  The  controversy  surrounding  the  use  of  polygraph 
tests  centers  on  the  subjective  judgment  of  polygraph  examiners  in  classifying  the  subject  as 
deceptive  or  non-deceptive.  The  object  of  this  project  is  to  develop  an  automatic  scoring 
system  to  overcome  this  perception.  The  computer  algorithm  will  be  able  to  use  more 
sophisticated  techniques  than  human  examiners,  should  be  more  accurate  and  will  ensure 
consistency  from  case  to  case. 

In  order  to  implement  the  automatic  scoring  system,  two  main  algorithms  were  developed. 
These  were;  the  feature  extraction  algorithm,  which  process  the  polygraph  data  in  three 
time,  correlation  and  frequency  domains,  and  the  fuzzy  classifier  algorithrn,  which  accepts 
the  features  and  determines  the  possibility  of  deception.  Because  of  the  nature  of  the  input, 
fuzzy  logic  was  chosen  to  implement  the  system  which  gives  the  possibility  of  belonging  of 
an  input  to  each  class.  Initially,  a  set  of  features  based  on  physiologic^  reactions  were 
selected.  Then,  the  fuzzy  K-nearest  neighbor  classifier  was  used  to  classify  the  features. 
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1  Polygraph 


1.1  Polygraph  Examination 

The  primary  use  of  the  polygraph  test  is  during  the  investigation  stage  of  the  criminal  justice 
process.  In  addition  to  the  sigrjficance  role  in  criminal  justice,  they  are  also  used  for 
national  security,  intelligence  and  counterintelligence  activities  [1],  The  three  physiological 
responses  currently  obtained  from  a  polygraph  examination  are  electrocardiogram, 
respiration  and  galvanic  skin  response.  Electrocardiogram  is  measured  by  placing  a  standard 
cuff  on  the  arm  over  the  brachial  artery.  Respiration  is  monitored  by  placing  rubber  tubes 
around  the  abdominal  area  of  the  subject.  Skin  conductivity  is  measured  by  electrodes 
placed  on  two  fingers  of  the  same  hand  of  the  subject  [1]. 

The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  is  an  ordered  combination  of  relevant  question  about  an 
issue,  control  questions  that  provide  physiological  responses  for  comparison  and  irrelevant 
questions  that  act  as  a  buffer  [1].  An  example  or  a  relevant  question  is, "  did  you  embezzle 
any  of  the  missing  $12000?"  The  corresponding  control  question  would  be  about  stealing; 
an  example  is,  "did  you  ever  steal  money  or  property  from  an  employer?"  The  example  of 
an  irrelevant  question  is, "  is  your  name  John?"  Irrelevant  questions  are  answered  truthfully 
and  are  not  stressful.  The  rational  for  scoring  these  tests  is  that  a  deceptive  subject  will  be 
more  threatened  by  the  relevant  question  than  by  the  control  question  while  a  non  deceptive 
subject  will  be  more  threatened  by  the  control  questions  than  the  relevant  question. 

Polygraph  charts  are  usually  analyzed  by  a  human  interpreter  for  evidence  of  truth  or 
deception.  A  control  question  polygraph  chart  usually  consists  of  3  sets  of  control  relevant 
question  pairs  separated  by  neutral  questions.  The  examiner  scores  the  charts  by  comparing 
each  relevant  question.  For  each  of  three  physiological  responses,  he  will  give  a  numeric^ 
score  ranging  from  -3  tO  +3,  depending  on  the  magnitude  of  the  difference.  He  then  adds  up 
scores  for  all  control  relevant  pairs.  If  the  score  is  below  threshold  value,  he  scores  the 
chart  as  deceptive  or  non  deceptive. 

Sometimes  the  examiner  can  not  make  a  clear  decision  and  must  score  the  chart  as 
inconclusive.  The  examiner's  decision  will  be  based  on  his  or  her  experience  and  traiiung. 
For  example,  a  change  in  the  polygraph  tracing  considered  by  one  examiner  as  a 
physiological  changes,  may  be  considered  by  another  as  an  artifact  of  the  recording  system. 
In  an  effort  to  eliminate  the  inconsistencies  involved  in  interpreting  polygraph  data, 
computer  algorithm  are  being  developed. 
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1.2  History’ 

The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception  occurred 
around  1 895  [2],  That  was  the  year  that  Cesar  Lombroso  published  the  results  of  his 
experiments  in  which  a  hydrosphygmograph  was  used  to  measure  the  blood  pressure-pulse 
changes  of  criminals  in  order  to  determine  whether  or  not  they  were  deceptive.  Although 
the  hydrosphygmograph  was  originally  intended  to  be  used  for  medical  purposes, 
Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may  have  been  the  first 
to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a  suspect  a  series  of 
photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the  suspect  did  not 
react  more  to  the  victims  picture  than  the  pictures  of  the  other  children,  Lombroso 
concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and  therefore  was  not 
the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by  measuring 
recorded  respiration  tracings  [3],  He  found  that  if  the  length  of  inspiration  were  divide  by 
the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before  lying  and  also 
before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson  constructed  an 
instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and  respiration 
during  an  examination  [2][3].  Larson  reported  accurate  results  which  prompted  Leonarde 
Keeler  to  construct  a  better  version  of  this  instrument  in  1926  [2][3]. 

The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the  century.  It's 
usefulness,  however,  did  not  become  evident  until  the  1930's  during  which  time  several 
articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New  York  [3]. 

In  these  articles  he  reports  over  90  criminal  cases  in  which  examination  using  the  galvanic 
skin  response  had  all  been  successful  and  confirmed  by  confession  or  supplementary 
evidence.  The  usefulness  of  the  galvanic  skin  response  prompted  Keeler  to  add  an 
galvanometer  to  his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler 
Polygraph  recorded  blood  pressure-pulse,  respiration,  and  galvanic  skin  response  [3], 


1.3  Modern  Test  Formats’ 

The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant  questions 
about  an  issue,  control  questions  that  provide  a  physical  response  for  comparison,  and 
irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response  for  comparison 
[1][3].  Three  general  types  of  test  formats  are  in  use  today.  These  are  Control  Question 
Tests,  Relevant-Irrelevant  Tests,  and  Concealed  Knowledge  Tests.  Each  of  the  general 
test  formats  may  have  a  number  of  more  specific  variations.  Each  test  consists  of  two  to 


’These  sections  were  exerpted  from  Jacobs  [10]. 
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five  charts  containing  a  prescribed  series  of  questions.  The  test  format  that  is  used  in  an 
examination  is  determined  by  the  test  objective  [2][3]. 

The  concealed  knowledge  test,  also  called  peak  of  tension  test,  is  used  when  facts  about  a 
crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a  subject 
would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example,  if  a  gun 
was  used  in  a  crime  and  the  public  did  not  know  the  caliber,  an  examiner  could  ask  a 
suspect  if  it  was  a  22  caliber,  a  38  caliber,  or  a  9  mm.  If  the  gun  used  was  a  9  mm  and  the 
suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of  deception. 

A  control  question  test  is  often  used  in  criminal  investigations.  Relevant-Irrelevant  tests 
are  usually  used  to  test  people  trying  to  obtain  security  clearance  or  get  a  job.  In  this  test, 
relevant  questions  are  compared  to  irrelevant  questions.  Very  few  control  questions  are 
asked.  The  purpose  of  control  questions  in  this  test  is  to  make  sure  that  the  subject  is 
capable  of  reacting  at  all. 


1.4  Present  Day  Equipment^ 

The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in  1945 
and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [1][4].  The  Reid 
polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biologic^  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 
The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way. 

Both  machines  record  the  same  biological  signals  using  standard  methods.  Blood  pressure 
is  measured  by  placing  a  standard  blood  pressure  cuff  on  the  arm  over  the  brachial  artery. 
Respiration  is  monitored  by  placing  rubber  tubes  around  the  abdominal  area  and  the  chest 
of  the  subject.  This  results  in  two  signals,  an  upper  and  lower  respiratory  signal.  Skin 
conductivity  is  measured  by  placing  electrodes  on  two  fingers  of  the  same  hand. 


^This  section  was  exerpted  from  Jacobs  [10]. 
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2  Classifier  Algorithm 

2.1  K-Nearest  Neighbor  Algorithm^ 

K-nearest  neighbor  algorithm  is  a  supervised  classification  method.  There  is  no  need  for 
the  training  or  adjusting  the  classifier.  A  set  of  labeled  input  samples  is  given  to  the 
classifier.  When  a  new  sample  is  given  to  the  system,  it  fods  its  K  nearest  neighboring 
samples,  and  assigns  this  sample  to  the  class  that  the  majority  of  the  neighbors  belong  to. 
K  could  be  any  positive  integer.  When  K  is  set  to  1,  the  algorithm  is  called  the  nearest 
neighbor  algorithm.  In  this  case  each  new  sample  is  assigned  to  the  class  of  its  nearest 
neighbor.  If  K  is  greater  than  1,  it  is  possible  that  there  is  no  majority  class.  To  remove 
this  tie,  the  sum  of  the  distances  of  the  new  sample  to  its  neighbors  in  each  class  is 
computed  and  the  sample  is  assigned  to  the  class  that  has  the  minimum  distance.  The 
main  advantage  of  using  this  method  is  that  the  samples  of  each  class  are  not  needed  to 
cluster  in  a  pre  specified  shape.  For  example,  for  a  two  class  classification,  the  K-nearest 
neighbor  classifier  can  still  give  very  good  results  if  the  samples  of  each  class  are  clustered 
in  two  distinct  points  in  the  space.  The  algorithm  for  the  K  nearest  neighbor  is  shown  in 
flow  chart  1 .  It  is  supposed  that  C  is  the  number  of  classes,  K  is  the  number  of  neighbors 
in  KNN,  AT.  jfj  is  the  '\th  labeled  sample  and  y  is  the  input  to  be  classified. 


^This  section  was  exerpted  from  Layeghi  [11]. 
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Flow  chart  1.  Fuzzy  K  Nearest  Neighbor  Algorithm 


The  fuzzy  K  nearest  neighbor  algorithm  uses  the  same  idea  of  conventional  K  nearest 
neighbor  algorithm,  that  is  finding  the  K  samples  that  are  closest  to  sample  to  be  classified. 
But  there  is  a  conceptual  difference  in  classification.  When  fuzzy  classification  is  used,  the 
input  is  not  assigned  to  a  single  class.  Instead,  the  degree  of  belongings  of  the  input  to 
each  class  is  determined  by  the  classifier.  By  using  this  method  more  information  is 
obtained  about  the  input.  For  example  if  the  result  of  classification  determines 
membership  of  an  input  to  class  A  is  0.9  and  to  class  B  is  0.1,  it  means  the  input  belongs 
to  class  A  with  a  very  good  possibility.  But  if  the  membership  to  class  A  is  0.55  and  to 
class  B  is  0.45,  it  means  that  we  cannot  be  very  sure  about  the  classification  of  the  input. 

If  the  crisp  classifier  is  used,  in  both  cases  the  input  will  be  assigned  to  class  A  and  no 
further  information  is  obtained. 

Refer  to  [5]  [6]  for  more  detailed  discussions  about  fuzzy  K  nearest  neighbor  algorithms. 
The  flowchart  for  a  fuzzy  K  nearest  neighbor  classifier  is  drawn  in  flow  chart  2. 

The  first  step  in  the  fuzzy  K  nearest  neighbor  algorithm  is  the  same  as  first  step  in  crisp 
classifier.  In  both  cases  K  nearest  neighbors  of  the  input  are  found.  While  in  crisp 
classifier  the  majority  class  of  the  neighbors  is  assigned  to  the  input,  in  Fuzzy  classifier 
membership  of  the  input  to  each  class  should  be  found.  In  order  to  do  so  the  membership 
vector  of  each  sample  is  combined  to  obtain  the  membership  vector  of  the  input.  If  the 
samples  are  crisply  classified,  membership  vectors  should  be  assigned  to  them.  One 
method  to  do  so  is  to  assign  the  membership  of  1  to  the  class  that  it  belongs  to,  and 
membership  of  0  to  other  classes.  Other  methods  assign  different  memberships  to  the 
samples  according  to  its  distance  from  the  mean  of  the  class,  or  the  distances  from  the 
nearby  samples  of  its  own  class  and  the  other  classes. 

When  the  membership  vectors  of  the  labeled  samples  are  specified,  they  are  combined  to 
find  the  membership  vector  of  the  unknown  class.  This  procedure  should  be  done  in  a 
way  that  samples  that  are  closer  to  the  input  have  more  effect  on  the  resultant  membership 
function.  The  following  formula  uses  the  inverse  distance  to  weigh  the  membership 
functions,  x  is  the  input  to  be  classified,  is  the  j//j  nearest  neighbor  and  u^j  is  the 
membership  of  the  j//?  nearest  neighbor  of  the  input  in  class  i.  D(x,y)  is  a  distance  measure 
between  the  vectors  x  and  y  which  could  be  the  Euclidean  distance. 

u,(x)  =  ^ - 

'^(l/Dfx.x^)”-') 

;=i 

m  is  a  parameter  that  changes  the  weighing  effect  of  the  distance.  When  m  »  1,  all  the 
samples  will  have  the  same  weight.  When  m  approaches  1,  nearest  samples  have  much 
more  effect  on  the  membership  value  of  the  input. 


3  Frequency  and  correlation  Domain  Features 


3.1  Preview 

The  purpose  of  this  chapter  is  to  show  how  the  frequency  and  correlation  domain 
representations  of  polygraph  signals  can  be  used  effectively  in  polygraph  analysis.  The  first 
step  in  analysis  of  a  time  series  is  to  plot  the  data  and  to  obtain  simple  descriptive 
measures  of  the  main  properties  of  the  series.  For  some  series,  in  addition  to  features  such 
as  trend,  seasonal  effect  and  cyclic  changes,  more  sophisticated  features  such  as  mean, 
variance,  auto  correlation  and  frequency  content  will  be  required  to  provide  an  adequate 
analysis. 

Most  physical  processes,  including  polygraph  signals,  involve  a  random  element  in  their 
structures.  Currently,  human  examiners  score  polygraph  tests  by  analyzing  obvious 
features  in  the  time  domain.  It  is  presumed  that  processing  polygraph  signals  in  frequency 
and  correlation  domain  will  provide  features  which  are  discriminator  between  deceptive 
and  non-deceptive  subjects.  Before  finding  the  frequency  domain  features  the  trend  in  the 
electrocardiogram  channel  was  eliminated.  In  order  to  do  so,  a  high  frequency 
electrocardiogram  channel,  called  heart  pulse,  is  produced  by  highpass  filtering  it. 

The  goal  of  this  chapter  is  to  explain  the  techniques  used  to  extract  appropriate  features  in 
frequency  and  correlation  domains.  The  methods  for  estimating  features  of  the  polygraph 
signals  such  as  fundamental  frequency,  spectral  density  and  cross  correlation  between  the 
channels  will  be  discussed. 


3.2  Fundamental  Frequency 

One  feature  which  is  considered  important  in  the  frequency  domain  is  the  fundamental 
frequency  of  the  signal.  The  purpose  of  finding  the  fundamental  frequency  is  to  classify 
the  way  the  frequency  changes  in  a  specific  time  segment.  The  assumption  in  polygraph 
signals  is  that  the  frequency  of  the  signal  changes  after  a  relevant  or  a  control  question  is 
asked.  Different  methods  have  been  proposed  to  find  the  fundamental  frequency  of  a 
signal.  One  of  these  methods  is  using  the  auto  correlation  function. 

The  auto  correlation  representation  of  a  signal  is  a  convenient  way  of  displaying  certain 
properties  of  the  signal.  For  example,  the  auto  correlation  function  of  a  periodic  signal  is 
also  periodic  with  the  same  period.  For  periodic  signals  with  period  P,  the  auto 

correlation  function  attains  a  maximum  at  samples  0,±P  ,±2P .  Regardless  of  the  time 

origin  of  the  signal,  the  period  can  be  estimated  by  finding  the  location  of  the  first 
maximum  in  the  auto  correlation  function  [7]. 
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This  property  makes  the  auto  correlation  function  an  attractive  basis  for  estimating 
periodicity  in  most  signals  including  the  electrocardiogram  and  respiration  signals  of  the 
polygraph  records.  Therefore,  a  short  segment  of  the  signals  (electrocardiogram  and 
j  respiratory)  after  each  question  is  selected  and  pre-processed.  The  auto  correlation  is  then 

calculated  for  the  windowed  segments  of  the  heart  pulse  and  respiratory  signals  using 
I  MATLAB.  Figure  1  shows  the  examples  of  auto  correlation  functions  computed  for  heart 

I  pulse  with  N  =  1 50  and  upper  respiratory  with  N  =  400  sampled  at  30  Hz.  N  is  the 

number  of  samples . 

t 

i  It  is  noticeable  that  the  auto  correlation  functions  of  the  above  signals  are  a  mixture  of 

damped  exponential  and  sinusoids.  For  the  heart  pulse,  peaks  occur  approximately  at 
!  multiples  of  20  samples  indicating  a  period  of 20/30=0.67  seconds  or  a  fundamental 

frequency  of  approximately  1 .5  Hz.  For  the  upper  respiratory,  peaks  occur  approximately 
at  multiples  of  133  samples  indicating  a  period  of  133/30  =  4.4  seconds  or  a  fiindamental 
frequency  of  approximately  0.23  Hz. 


Figure  1.  Plots  of  auto  correlation  function  for  (a)  heart  pulse  and  (b)  upper  respiratory 
where  k  is  the  number  of  samples. 
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For  some  subjects,  the  period  of  the  electrocardiogram  or  upper  respiratory  signal  changes 
across  the  N  sample  interval.  Also,  the  shape  of  the  signal  varies  somewhat  from  period 
to  period.  Because  of  the  finite  length  of  segments  involved  in  the  computation  of  auto¬ 
correlation,  there  is  less  and  less  data  involved  in  the  computation  as  the  lag  increases. 

This  leads  to  the  reduction  in  amplitude  of  the  correlation  peaks  as  lag  increases. 

An  important  issue  is  how  N  should  be  chosen  to  give  a  good  indication  of  periodicity. 
Because  we  are  interested  in  observing  changes  in  signal  after  the  question  is  asked,  N 
should  be  small.  On  the  other  hand,  it  should  be  noted  that  to  get  any  indication  of 
periodicity  in  the  auto  correlation  function,  the  window  must  have  the  duration  of  at  least 
two  periods  of  the  waveform.  In  order  to  choose  the  best  N,  the  fundamental  frequency 
for  different  time  frames  without  overlap  were  calculated  and  the  results  were  examined. 
The  fundamental  frequencies  of  heart  pulse  for  the  four  second  frame  are  shown  in  Table  1 
and  2  in  Appendix  A.  No  single  value  of  N  is  entirely  satisfactory  because  the  frequency 
changes  from  individual  to  individual.  However,  a  suitable  practical  choice  for  N  was 
chosen  on  the  order  of  180  and  480  for  heart  pulse  and  upper  respiratory  respectively. 


3.3  Modeling 

Detailed  information  about  a  time  series  can  be  obtained  from  creating  a  model.  In  this 
section  a  model  will  be  found  for  the  heart  pulse  signal.  Finding  a  suitable  model  for  a 
given  time  series  depends  on  the  properties  of  the  series  and  the  number  of  observations 
available.  In  signal  modeling  the  output  signal  is  known  and  the  model  development  is 
based  upon  the  fact  that  signal  points  are  correlated.  Estimated  auto  correlation  function 
(ACF)  of  the  time  series  is  helpful  in  identifying  which  type  of  ARMA  model  is 
appropriate  and  gives  the  best  representation  of  the  signal. 

The  ACF  of  a  MA  process  cuts  off  at  lag  q  whereas  the  ACF  of  an  AR  process  is  a 
mixture  of  damped  exponential  and  sinusoids  and  dies  out  slowly.  For  example,  if  rl  is 
significantly  different  from  zero  but  the  subsequent  values  of  n  are  all  close  to  zero  then 
an  MA(1)  model  is  indicated  since  its  theoretical  ACF  is  of  this  form.  Alternatively,  if 
n,r:,r3 ,..  appear  to  be  decreasing  exponentially,  then  an  AR(1)  model  may  be 
appropriate. 

It  is  usually  difficult  to  find  the  order  of  an  AR  process  from  the  sample  ACF  alone.  A 
model  with  too  low  an  order  will  not  represent  the  properties  of  the  signal.  Also  a  model 
with  too  high  an  order  will  represent  any  measurement  noise  or  inaccuracies.  Therefore, 
neither  a  high  order  nor  a  low  order  model  will  be  a  reliable  representation  of  the  signal. 
As  a  result,  method  that  will  determine  the  model  order  should  be  used.  One  approach  is 
to  fit  AR  processes  of  progressively  higher  order,  to  calculate  the  squared  error  for  each 
value  of  model  order  (M),  and  to  plot  this  against  model  order.  It  may  then  be  possible  to 
see  the  value  of  M  where  the  curve  flattens  out  and  the  addition  of  extra  parameters  gives 
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little  improvement  in  fit.  Another  approach  based  upon  the  principals  of  prediction  is  that 
to  increase  the  model  order  until  the  residual  process  becomes  a  white  noise. 

Other  criteria  have  been  developed  that  are  based  upon  concepts  in  mathematical  statistics 
[9].  The  first  one  is  the  final  prediction  error  (FPE), 


FPE  =  P 


A^  +  A/  +  l 
N-M-\ 


(3.3a) 


Where  P,  N  and  M  are  error,  number  of  samples  and  model  order  respectively. 

The  fractional  portion  of  FPE  increases  with  M  and  accounts  for  the  inaccuracies  in 
estimating  the  parameters.  The  other  criterion  is  called  Akaike's  information  criterion 
(AIC).  It  is: 

AIC  =  A^lnp2  +  2P  (3.3b) 

The  first  criterion  tends  to  have  a  minimum  at  values  of  M  that  are  less  than  the  model 
order  and  the  second  one  tends  to  overestimate  model  order. 


The  above  criteria  were  calculated  for  electrocardiogram  signal  and  the  results  were 
plotted  in  Figure  2.  As  shown  in  Figure  2(a),  the  error  decreases  but  there  is  no  definitive 
slope  change.  The  largest  decrease  occurs  from  order  1  to  2  and  the  error  does  not  seem 
to  decrease  significantly  with  orders  greater  than  11.  For  FPE  (Figure  2(b))  and  AIC 
(Figure  2(c))  plots,  the  error  does  not  decrease  much  with  orders  greater  than  1 1 .  Thus, 
the  order  can  be  approximately  10.  The  Levinson-Durbin  algorithm  was  used  to  calculate 
the  AR  parameters  with  order  10  for  heart  pulse.  These  parameters  were  used  as  features. 


Figure  2.  The  different  criteria  for  heart  pulse  versus  model  order  (M):  (a)  error,  (b) 
FPE;  (c)AIC. 


3.4  Cross-covariance  and  cross-correlation  functions 

In  general,  it  may  be  necessary  to  study  the  interactions  between  two  processes  with 
possibly  different  scales  of  measurement  or  different  variances.  In  polygraph  where  time 
series  data  are  generated  from  more  than  one  channel  at  a  time,  features  like  cross¬ 
correlation  which  contain  information  about  relationships  between  the  channels  are 
extracted.  The  cross  covariance(Co )  and  cross  correlation  function  {rxy)  are  defined  as 
following: 


Cxy{k  )  — 


*=i 


N 


r^  =  Cxy/4CxxiO)Cyym 

,  ^Xin)  ^Yin) 

where  Wx  =  ^ — —  rriy  -  ^ 


[A:  =  0,l,....(Ar-l)]  (3.4a) 

(3.4b) 

(3.4c) 


A'  ' 

Cxr  (0)  and  C>y  (0)  are  the  variances  of  observations  on  X  and  Y  respectively. 
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This  estimate  is  asymptotically  unbiased.  However,  the  variance  of  the  estimate  depends 
on  the  auto  correlation  functions  of  the  two  components.  Therefore,  for  moderately  large 
values  of  N  it  is  possible  for  two  series,  which  are  actually  uncorrelated,  to  give  rise  to 
large  cross-correlation  coelSicients  which  are  actually  spurious.  Thus,  both  series  should 
first  be  filtered  to  convert  them  to  white  noise  before  computing  the  cross-correlation 
function  [8]. 

In  order  to  determine  the  relationship  between  the  upper  respiratory  and  heart  rate,  the 
cross  correlation  between  them  was  calculated.  Figure  3  shows  the  cross  correlation 
between  heart  pulse  and  upper  respiratory  for  a  control  and  a  relevant  question  for  two 
different  deceptive  and  non  deceptive  cases. 


Figure  3.  Cross  correlation  between  upper  respiratory  and  heart  pulse  before 
modeling,  (a)  and  (b)  90  seconds  after  relevant  question  5.  (b)  and 
(c)  90  seconds  after  control  question  6. 
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3.5  Whitening  filter 

For  a  given  process  {x(n)},  the  innovation  process  {v(n)}  is  defined  as  a  white  noise 
process  such  that  {v(n))  can  be  determined  from  the  signal  {x(n)}  by  the  whitening  filter. 
The  innovations  representation  of  a  random  process  is  a  powerful  analytic  tool.  The 
innovation  process  makes  the  interpretation  of  the  original  process  simpler  than  the 
original  signal.  Yet  both  processes  contain  the  same  statistical  information.  In  other 
words,  there  is  no  loss  of  information  as  a  result  of  the  transformation. 

As  stated  in  section  3.4,  it  is  possible  for  two  series,  which  are  actually  uncorrelated,  to 
give  rise  to  large  cross-correlation  coefficients  which  are  actually  spurious.  Thus,  the 
series  should  first  be  filtered  to  convert  them  to  white  noise  before  computing  the  cross¬ 
correlation  function.  The  AR  parameters  were  used  to  design  the  whitening  filter .  Then, 
the  heart  pulse  signal  was  filtered  to  convert  it  to  white  noise. 

When  the  time  series  is  white  noise  and  purely  random,  the  neighboring  points  of  the  ACF 
are  uncorrelated.  In  order  to  compare  the  whitening  filter  output  and  the  theoretical  white 
noise,  both  the  output  of  the  whitening  filter  and  its  auto  correlation  for  electrocardiogram 
were  plotted  in  Figure  4.  It  is  seen  that  the  auto  correlation  shows  high  correlation  for  lag 
zero  (k=175)  and  small  correlation  for  other  lags  as  it  expected. 


Figure  4.  Plots  of  (a)  white  noise  (output  of  the  whitening  filter);  (b)  auto  correlation 
of  the  white  noise. 
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The  heart  pulse  and  its  innovation  process  (pre  whitening  filter  output)  contain  the  same 
information.  The  results  of  cross-correlation  between  upper  respiratory  and  heart  rate 
signals  after  pre  whitening  are  shown  in  figure  5.  It  can  be  seen  that  the  cross-correlation 
after  modeling  is  similar  to  the  cross  correlation  before  modeling  (Figure  2)  with  less 
spurious  peaks.  The  maximum  and  minimum  value  of  cross  correlation  and  their  lags 
were  considered  as  potential  features  in  correlation  domain.  As  presented  in  figure  5  (b), 
heart  pulse  and  upper  respiratory  channels  are  positively  correlated  after  the  30  to  90  lags 
(1-3  seconds)  and  are  negatively  correlated  after  130  lags  (4.3  seconds). 


Figure  5.  Cross  correlation  between  heart  pulse  and  upper  respiratory  after  modeling  for 
(a)  and  (b)  90  seconds  after  relevant  question  5.  (b)  and  (c)  90  seconds 
after  control  question  6. 
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3.6  Spectral  Analysis 

In  this  section  the  frequency  properties  of  the  polygraph  signals  such  as  power  spectrum 
and  cross  spectral  density  are  analyzed.  The  cross-correlation  and  cross  spectral  density 
are  the  tools  for  examining  the  relationships  between  two  signals  in  the  time  and  frequency 
domains  respectively.  The  power  spectrum  shows  how  the  variance  of  the  signal  is 
distributed  with  frequency.  The  total  area  underneath  the  spectrum  curve  is  equal  to  the 
variance  of  the  sign^.  A  peak  in  the  spectrum  indicates  an  important  contribution  to  the 
variance  at  different  frequencies. 

The  estimated  spectrum  for  different  channels  were  plotted  on  linear  scale  in  Figure  6  and 
on  logarithmic  scale  in  Figure  7.  For  spectrum  showing  large  variations  in  power,  a 
logarithmic  scale  makes  it  possible  to  show  more  detail  over  a  wide  range.  However,  this 
exaggerates  the  visual  effects  of  variations  where  the  spectrum  is  small.  It  is  often  easier 
to  interpret  the  spectrum  plotted  on  a  linear  scale  than  logarithmic  scale. 


Figure  6.  Frequency  contents  of  four  polygraph  signals  on  linear  scale,  (a)  GSR  for 
480  samples,  (b)  heart  pulse  for  200  samples,  (c)  and  (d)  lower  and  upper 
respiratory  for  480  samples. 
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Figure  7.  Frequency  contents  of  four  polygraph  signals  on  logarithmic  scale,  (a)  GSR 
for  480  samples,  (b)  heart  pulse  for  200  samples,  (c)  and  (d)  lower  and  upper 
respiratory  for  480  samples. 


Figure  7  shows  for  GSR  the  variance  is  concentrated  at  low  frequencies  indicating  a  trend 
or  non-stationary  behavior.  The  spectrum  for  heart  pulse  signal  shows  the  presence  of 
harmonics  with  a  large  peak  at  fundamental  frequency  of  f  =  2  Hz  and  related  peaks  at 
2f,  3f,  ....These  multiples  of  the  fundamental  indicate  the  non  sinusoidal  character  of  the 
main  cyclical  component. 

The  correlation  between  two  signals  can  be  described  in  the  frequency  domain  by  their 
cross  amplitude,  phase  spectra  or  the  squared  coherency.  The  coherency  measures  the 
linear  correlation  between  the  two  components  of  the  two  channels  at  frequency  f .  The 
closer  the  coherency  is  to  one,  the  more  closely  related  are  the  two  signals  at  frequency  f 

The  MATLAB  function  spectrum.m  finds  the  cross-spectrum  and  coherency  between 
upper  respiratory  and  electrocardiogram  and  are  shown  in  Figure  8.  Their  cross  spectrum 
shows  a  large  peak  at  f  =  2  Hz.  Maximum  cross  spectral  density  and  the  magnitude  of 
cross  spectral  density  and  coherency  at  fundamental  frequency  and  the  second  harmonic 
were  considered  as  features  in  frequency  domain. 
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Figure  8.  Plots  of  coherency  and  cross  spectral  density  between  heart  pulse  and 
upper  respiratory  signals. 


3.7  Integrated  spectral  distance 

This  section  describes  how  to  obtain  a  feature  in  the  frequency  domain  called  integrated 
spectral  difference.  This  feature  was  introduced  by  Martin  and  Pounds  [12].  Other 
features  are  calculated  separately  for  each  control,  relevant  and  irrelevant  questions.  The 
integrated  spectral  distance  is  calculated  in  a  different  way  than  the  other  features.  This 
feature  is  calculated  by  taking  the  difference  between  the  cumulative  values  of  the  power 
spectral  density  for  each  relevant  and  its  closest  control  question.  The  integrated  spectral 
distance  measures  the  distance  between  a  control  and  a  relevant  question  directly.  Figure 
9  shows  the  cumulative  spectral  density  for  a  control  and  a  relevant  question.  The 
maximum,  the  frequency  where  this  maximum  happens  and  the  area  underneath  were 
considered  as  features. 
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Figure  9.  Cumulative  integrated  spectral  density  for  a  control  question  and 
relevant  question  of  the  heart  pulse  signal. 
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3.8  Frequency  and  Correlation  Domain  Features 


Table  1  summarizes  the  frequency  and  correlation  features  explained  in  the  above  sections 


Feature 

Channel 

Maximum  cross  correlation 

between  2  &  6 

Lag  of  maximum  cross  correlation 

between  2  &  6 

Minimum  cross  correlation 

between  2  &  6 

Lag  of  minimum  cross  correlation 

between  2  &  6 

Spectral  value  at  fundamental  frequency 

2 

Spectral  value  at  fundamental  frequency 

6 

Spectral  value  at  (fundamental  frequency  of  channel  2)  *2 

2 

Spectral  value  at  (fundamental  frequency  of  channel  6)  *2 

6 

Maximum  cross  spectral  density 

between  2  &  6 

Coherency  at  fundamental  frequency  of  channel  2 

between  2  &  6 

Coherency  (at  fundamental  frequency  of  channel  2)’'‘2 

between  2  &  6 

Fundamental  frequency 

2 

Fundamental  frequency 

5 

Maximum  or  minimum  integrated  spectral  difference 

1 

Frequency  of  the  maximum  integrated  spectral  difference 

1 

Area  underneath  integrated  spectral  difference 

1 

maximum  or  minimum  integrated  spectral  difference 

2 

Frequency  of  the  maximum  integrated  spectral  difference 

2 

Area  underneath  integrated  spectral  difference 

2 

Autoregressive  parameter 

2 

Table  1.  Frequency  and  correlation  domain  features. 
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4  Feature  extraction 


4.1  Preprocessing 

This  chapter  explains  the  steps  taken  in  feature  extraction  algorithm.  In  polygraph  tests, 
four  physiological  responses  are  measured.  These  responses  are;  upper  respiratory,  lower 
respiratory,  galvanic  skin  response  (GSR)  and  electrocardiogram.  These  four  polygraph 
responses  are  processed  into  six  channels.  A  low  frequency  electrocardiogram  channel  is 
produced  by  lowpass  filtering  the  electrocardiogram  channel.  A  high  frequency 
electrocardiogram  channel  is  produced  by  highpass  filtering  it.  The  high  frequency 
electrocardiogram,  called  heart  pulse,  the  low  frequency  electrocardiogram,  called  blood 
volume  and  derivative  of  the  low  frequency  electrocardiogram  are  used  instead  of  one 
electrocardiogram  channel.  To  eliminate  the  noise  and  any  trend,  all  the  signals  are 
filtered  and  detrended.  For  more  information  about  the  filtering  and  detrending  refer  to 
Jacobs  [10]. 
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4.2  Feature  Selection 


Many  of  the  time  domain  features  were  selected  based  on  the  examiners'  suggestions. 
However,  many  of  the  standard  statistical  features  were  also  considered  as  potential 
features.  For  more  information  about  time  domain  features  refer  to  Jacobs  [10].  The 
selected  features  and  the  channels  which  they  were  extracted  from  are  listed  below. 


Features 

Channel 

l)Mean 

1,2, 3, 4,5, 6 

2)  Standard  deviation 

1,2, 3, 4,5, 6 

3)  Minimum 

1,2, 3, 4,5, 6 

4)  Maximum 

1,2, 3, 4,5, 6 

5)  Curve  length 

1,2, 3,4, 5. 6 

6)  Mean  of  derivative 

1,2, 3, 4,  5, 6 

7)  Median  of  derivative 

1,2, 3, 4, 5, 6 

8)  Average  amplitude  of  peaks 

2.5,6 

9)  Minimum  amplitude  of  peaks 

2.5,6 

10)  Derivative  of  amplitudes  of  peaks 

2.5,6 

11)  Number  of  peaks 

2,5.6 

12)  Minimum  subtracted  from  maximum 

1,2, 3, 4, 5, 6 

13)  Inhalation/exhalation 

5,6 

14)  ratio  of  inhalation/exhalation  before 

5.6 

and  after  a  question  is  asked 

15)  Fundamental  frequenc)' 

2,  5 

16)  Maximum  cross  correlation 

between  2  and  6 

17)  Lag  of  maximum  cross  correlation 

between  2  and  6 

18)  Minimum  cross  correlation 

between  2  and  6 

19)  Lag  of  minimum  cross  correlation 

between  2  and  6 

20)  Spectral  value  at  fundamental  frequency* 

between  2  and  6 

21)  Spectral  value  at  second  harmonic 

between  2  and  6 

22)  Maximum  cross  spectral  density 

between  2  and  6 

23)  Coherencj-  at  fundamental  frequency 

between  2  and  6 

24)  Coherency  at  second  harmonic 

between  2  and  6 

25)  Autoregressive  parameters(AR) 

2 

26)  Maximum  or  minimum 

1.2 

integrated  spectral  difference  (ISD) 

27)  Frequency  of  maximum  ISD 

1,2 

28)  Area  under  ISD 

1, 2 
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4.3  Feature  Extraction  Algorithm 

AJl  features  are  extracted  for  10  relevant,  irrelevant  and  control  questions  except  features 
26,  27  and  28  that  are  extracted  for  each  relevant  and  its  closest  control  question.  The 
program  called  fextract.m  extracts  all  the  basic  features  for  each  question  on  each  chart 
for  about  18  non-deceptive  and  51  deceptive  cases.  Due  to  the  small  number  of  non- 
deceptive  cases,  each  chart  for  a  subject  was  used  as  a  separate  case.  By  doing  this  50 
non-deceptive  and  150  deceptive  files  were  created. 

The  test  format  used  in  this  project  is  MGQT  format.  It  is  a  type  of  control  question  test 
in  which  relevant,  irrelevant  and  control  questions  are  asked  in  a  specific  order.  Each 
polygraph  test  is  made  of  three  and  in  very  rare  cases  four  charts  for  each  case.  The 
order  in  which  the  questions  are  asked  is  changed  in  the  third  and  fourth  charts  and 
sometimes  in  the  second  chart.  The  feature  extraction  routine  needs  to  have  the  control, 
relevant  and  irrelevant  questions  labeled.  Therefore,  for  each  polygraph  chart  a 
complementary  chart  called  question  file  was  created  which  contains  a  matrix  called  Q. 

The  first  row  of  this  matrix  contains  the  relevant,  the  second  row  the  irrelevant  and  the 
third  row  the  control  questions  respectively. 

Fragments  of  each  signal  are  selected  before  features  are  extracted.  These  fragments  are 
shown  in  Table  2.  Start  and  end  points  given  in  the  table  refer  to  the  time  elapsed  after  the 
question  is  asked.  A  vector  of  features  for  each  file  is  created  by  the  program  feature.m 
which  is  called  by  fextract.m  program.  The  program  first  executes  all  of  the  processing 
routines  and  then  extracts  the  features  for  each  question  in  the  file.  The  features  are 
extracted  for  the  appropriate  time  segment  (see  Table  2)  of  six  channels  for  each 
polygraph  file.  The  time  segment  is  created  by  taking  a  sample  of  time  series  starting 
several  seconds  after  a  question  is  asked  and  continuing  for  a  number  of  seconds. 
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Channel  description 


Channel  Start 


End 


1 

Galvanic  Skin  conductivity(GSR) 

1 

2  sec. 

14  sec. 

I 

1 

i 

/ 

High  frequency  electrocardiogram 

2 

2  sec. 

9  sec. 

Low  frequency  electrocardiogram  (LC) 

3 

2  sec. 

18  sec. 

1 

\ 

\ 

Derivative  of  low  frequency 
electrocardiogram  (DLC) 

4 

0  sec. 

8  sec. 

! 

Lower  Respiratory  (LR) 

5 

2  sec. 

18  sec. 

Upper  Respiratory  (UR) 

6 

2  sec. 

18  sec. 

Table  2.  Time  fragment  used  in  feature  extraction 


The  feature  extraction  algorithm  provides  a  960  dimensional  vector  for  each  file.  The 
features  were  extracted  for  the  150  deceptive  and  50  non  deceptive  files  and  saved  in  a 
960  by  200  matrix  called  "  M".  In  order  to  classify  subjects  using  the  difference  between 
control  and  relevant  responses,  and  to  make  the  feature  vector  smaller,  the  features  were 
combined  according  to  the  following  method:  for  each  feature  /  except  features  26,  27,28 
from  each  subject  j  compute: 

1)  The  average  control  responses  .<4 vC// 

2)  The  average  relevant  responses  .(4v7?// 

3)  The  maximum  and  minimum  control  responses  MixC//  and  Af/nCy 

4)  The  maximum  and  minimum  relevant  responses  AToxT?//  zxi^MinRij 


The  feature  vector  components  for  feature  i  are  then: 
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l)Fij{2)- 

3) Fm  = 

4) Fm- 

5) FiJ{5)  = 

6) Fm-. 


l)Fij{l)^ 


AvRij  -  AvCij 
AvRij  -  AvCij 
AvRij  +  AvCij 

■  MaxRij  -  MaxCij 
-  MinRij  -  MinCij 
MaxRij  -  MinCij 

■  MinRij  -  MaxCij 
MaxRij 


MaxCij 


For  features  26,  27,  28  from  each  subject  j  compute; 

1)  The  average  of  relevant-control  responses 

2)  The  maximum  of  relevant-control  responses  Max(RC(ij) 

3)  The  minimum  of  relevant-control  responses 


The  feature  vector  components  for  feature  i  are  then: 


2) F,(2)  =  Max{RCi,)) 

3) Fji3)  =  Min{RCiij)) 

The  above  procedure  is  executed  by  program  called proces/.m  which  creates  a  669  by  200 
dimensional  matrix  called  "F".  In  order  to  run  the  classifier  program,  the  matrix  F  was 
divided  into  three  100  (50  deceptive  and  50  non-deceptive)  sets  of  matrices  called  setl, 
set2  and  set3.  These  sets  are  made  of  50  non-deceptive  cases  common  in  all  three  sets 
and  three  50  different  deceptive  sets,  called  deceptive  1,  deceptive  2  and  deceptive  3 
respectively.  The  list  of  the  files  used  in  the  setl,  set2  and  set3  are  shown  in  Table  3  in 
Appendix  A. 
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5  Results 


5.1  Frequency  Domain  Clustering 

Classifier  is  the  final  stage  in  a  pattern  recognition  system.  The  classifier  assigns  each 
input  to  one  of  the  classes.  The  classifier  could  be  designed  after  studying  the  distribution 
of  samples  in  each  class.  The  KNN  classifier  was  used  in  this  study  because  of  the 
following: 

1 )  The  uncertainty  about  the  shape  of  deceptive  and  non  deceptive  clusters  and 
their  sample  distributions. 

2)  The  possibility  that  the  samples  for  one  class  cluster  around  more  than  one  point 
in  space. 

The  frequency  domain  features  did  not  create  a  separate  distribution  of  samples  for 
deceptive  and  non  deceptive  classes.  However,  the  combination  of  frequency  and  time 
domain  features  resulted  in  more  distinct  clusters.  Figure  10  and  1 1  show  the  examples  of 
sample  distribution  (clustering)  for  non  deceptive  (x)  and  deceptive  (+)  classes. 


Figure  10.  Plot  of  maximum  of  GSR  versus  maximum  of  Upper  Respiratory. 
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Figure  11.  Plot  of  maximum  of  GSR  versus  frequency  of  maximum  integrated 
spectral  difference  of  GSR. 
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5.2  Discussion 


The  669  features  are  more  than  can  be  used  by  any  classification  techniques.  Thus,  the 
classification  program  and  the  scatter  measurement  program  were  run  for  each  feature  in 
each  set  individually.  The  results  of  the  first  experiment  were  examined  and  compared  to 
determine  the  features  which  were  the  best  discriminators  between  deceptive  and  non- 
deceptive  subjects.  After  comparing  the  results,  the  30  features  with  the  highest  accuracy 
rate  and  common  in  all  three  sets  were  selected.  These  best  features  were  listed  in 
Table  3. 

The  second  experiment  used  the  combination  of  two  features  out  of  the  best  30  features. 
The  results  for  the  best  30  features  were  examined  for  each  set  separately.  The  set3 
always  had  a  better  performance  than  the  other  two  sets.  However,  in  order  to  be 
consistent,  the  best  features  common  in  all  three  sets  were  selected  as  the  30  best  features. 
More  features  were  added  for  combination  of  three  and  four.  The  results  are  shown  in 
Table  4  and  5  in  Appendix  A. 

As  it  was  discussed  before,  the  classifier  was  used  to  compare  the  effectiveness  of  the 
single  features  and  to  choose  the  combination  of  the  best  features.  Changing  the  classifier 
parameters  such  as  K  might  change  the  results  of  the  classification.  However,  it  is  not 
practical  to  change  all  parameters  at  the  same  time.  Therefore,  the  classifier  was  used 
with  the  fixed  parameters  of  K=5  and  m=2.  After  selecting  the  final  feature  set,  theses 
parameters  were  changed  to  find  the  best  classification. 


No 

feature 

Description 

Channel 

Method 

1 

lOmean 

mean 

GSR 

1 

2 

lOcurve 

cuA'e  length 

GSR 

2 

3 

lOmed  dif 

median  of  the  derivative 

GSR 

1 

4 

lOmax  min 

minimum  subtracted  from  the  maximum 

GSR 

2 

5 

lOmax 

maximum  of  the  signal 

GSR 

1 

6 

lOmdif 

mean  of  derivative 

GSR 

3 

7 

20cur\’e 

curve  length 

Heart  pulse 

1 

8 

Heart  pulse 

1 

9 

20max  min 

minimum  subtracted  from  the  maximum 

Heart  pulse 

4 

10 

20max 

maximum  of  the  signal 

Heart  pulse 

4 

11 

20inin 

minimum  of  the  signal 

Heart  pulse 

1 

12 

30med  dif 

median  of  the  derivative 

Blood  pressure 

3 

13 

30max 

maximum  of  the  signal 

Blood  pressure 

1 

14 

40mean 

mean 

Derivative  of  Blood  pressure 

1 

15 

40max 

maximum  of  the  signal 

Derivative  of  Blood  pressure 

1 

16 

SOcun^e 

curve  length 

Lower  Respiratory 

6 

Lower  Respiratory' 

2 

18 

number  of  the  peaks 

Lower  Respiratory 

5 

19 

50ie 

inhalation  di\ided  by  exhalation 

Lower  Respiratory 

5 

20 

50ma\  min 

minimum  subtracted  from  the  maximum 

Lower  Respiratory 

2 

21 

50max 

maximum  of  the  signal 

6 

22 

60ma\  min 

minimum  subtracted  from  the  maximum 

2 

23 

60max 

maximum 

Upper  Respiratory 

3 

24 

lOstd 

standard  de\iation 

GSR 

2 

25 

20std 

standard  deviation 

Heart  pulse 

1 

26 

50std 

Standard  deviation 

Upper  Respiratory 

6 

27 

20annodl 

auto  regressive  parameter 

Heart  pulse 

7 

28 

26psdcohl 

max  cross  spearal  density 

Heart  pulse.  Lower 

Respiratoiy’ 

1 

29 

lOisdl 

frequencj’  of  maximum  integrated  spectral 
difference  of  control-relevant  pair 

GSR 

1* 

30 

20isdl 

area  under  integrated  spectral  difference 

Heart  pulse 

3* 

Methods:  l=DifFerence  of  Averages,  2=Normalized  Average,  3=Max-Max,  4=Min-Min, 

5=Max-Min,  6=Min-Max,  7=Max/Min ,  1*= Average  of  relevant-control  pairs,  3*=Max  of  relevant- 
control  pair. 

Table  3.  30  best  selected  Features 
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Conclusion 


The  classification  results  improved  consistently  by  increasing  the  number  of  features.  The 
best  features  are  {5  9  21  23}  and  (5  21  23  29}with  81  and  80  percent  correct 
classification  respectively.  These  features  are  maximum  of  GSR(5),  difference  between 
maximum  and  minimum  of  heart  pulse(9),  maximum  of  lower  respiratory(21),  maximum 
of  upper  respiratory(23)  and  frequency  of  maximum  integrated  spectral  difference  of 
control-relevant  pair  for  GSR(29). 

The  best  features  are  simple  and  obvious  features  such  as  maximum  and  minimum  of  the 
polygraph  signals.  In  other  words,  the  features  that  an  examiner  can  see  are  the  best 
discriminators  between  deceptive  and  non  deceptive. 

It  is  important  to  notice  that  the  best  features  are  the  combination  of  features  from  all  4 
different  GSR,  heart  pulse,  lower  and  upper  respiratory.  As  expected,  each  subject  shows 
reaction  to  different  channels.  Therefore,  the  combination  of  all  channels  is  the  best 
representative  of  deception. 

Another  point  to  notice  is  that  the  set3  has  better  classification  results  than  the  other  two 
sets.  For  example,  the  features  {9  14  19  24}  and  {5  21  23  29}show  87.4  and  86.6 
percent  correct  classification  for  set3.  The  data  in  set3  is  made  of  50  non  deceptive 
common  in  all  three  sets  and  50  deceptive  cases.  This  set  of  deceptive  cases,  called 
deceptive  3,  are  the  Acxiton  files  listed  in  Table  3  in  Appendix  A.  It  is  possible  that  there 
is  some  characteristic  in  these  deceptive  files  that  results  in  better  classification. 

As  stated  before,  due  to  the  small  number  of  non-deceptive  cases  available,  each  chart  for 
a  subject  was  used  as  a  separate  case.  After  classifying  the  charts,  the  charts  for  each  case 
were  combined  in  a  way  that  each  case  was  assigned  to  the  class  that  the  majority  of  the 
charts  belong  to.  Using  this  method,  the  classification  results  improved  from  81  percent 
to  85.6  percent  for  setl  and  set2  and  from  87  percent  to  91  percent  for  set3.  The  final 
result  is  included  in  appendix  A. 
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FILE  NAME 


FUNDAMENTAL  FREQUENCY  (Hz) 
CHANNEL :  Heart  pulse,  WINDOW:  120  S 


QQAV53P6.021 

relevant  = 
control  = 

1.3636 

1.2500 

1.3636 

1.5000 

1.3636 

1.4286 

QQAV53P6.031 

relevant  = 
control  = 

1.5000 

1.4286 

1.3636 

1.3636 

1.3043 

1.3636 

1.3636 

1.4286 

QQBQ4SHI.011 

relevant  = 
control  = 

2  2 

2  2 

2  2 

QQBQ4SHI.021 

rele\'ant  = 
control  = 

1.7647 

1.8750 

1.7647 

1.76 

1.7647 

1.8750 

QQBQ4SH1.031 

relevant  = 
control  = 

1.7647 

0.8571 

1.7647 

1.7647 

1.7647 

1.7647 

1.7647 

1.6667 

QQBSS7\\T.011 

relevant  = 
control  = 

1.5000 

1.5789 

1.5000 

1.4286 

1.5000 

1.3636 

QQBSS7WT.021 

rele\'ant  = 
control  = 

1.5000 

1.5000 

1.4286 

1.4286 

1.4286 

1.4286 

QQBSS7WT.031 

relevant  = 
control  = 

1.4286 

1.4286 

1.5000 

1.5000 

1.4286 

1.4286 

1.3636 

1.5000 

Table  1.  Fundamental  frequency  for  non-deceptive  files  for  120  seconds  for  heart  pulse. 
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FILE  NAME  FUNDAMENTAL  FREQUENCY(Hz) 

CHANNEL :  CARDIO,  WINDOW:  120  S 


QQ9SOW8L.021 

relwant  = 
control  = 

1.7647 

1.5789 

1.6667 

1.5789 

1.5789 

1.6667 

QQ9SOW8L.031 

relevant= 
control  = 

1.5789 

1.8750 

1.5789 

1.6667 

1.6667 

1.7647 

1.6667 

1.5789 

QQ9SQIK9.011 

relev’ant  = 
control  = 

1.5789 

1.5789 

1.5000 

1.5000 

1.5000 

1.5789 

QQ9SQIK9.021 

relevant  = 
control  = 

1.3043 

1.5789 

1.5789 

1.5789 

1.5789 

1.4286 

QQ9SQIK9.031 

rele\’ant  = 
control  = 

1.5000 

1.4286 

1.5000 

1.2000 

1.6667 

1.5789 

1.5789 

QQ9W0B9F.011 

relevant  = 
control  = 

1.5000 

1.4286 

1.4286 

1.5789 

1.5000 

1.5000 

QQ9W0B9F.031 

rele\'ant= 
control  = 

1.4286 

1.5000 

1.5000 

1.4286 

1.4286 

1.4286 

QQ9W0B9F.041 

relevant  = 
control  = 

1.4286 

1.4286 

1.3636 

1.3636 

1.4286 

1.5000 

QQ9U4FMU.011 

relevant  = 
control  = 

1.5789 

1.6667 

1.6667 

1.5789 

1.6667 

1.6667 

Table  2.  Fundamental  frequency  for  deceptive  files  for  120  seconds  for  heart  pulse. 
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QQ8R9OIO.011 

QQ4Q  1083.0 11 

QQ7LX5Q0.021 

QQ8RAJ0C.011 

QQ8R9OIO.021 

QQ4Q1O83.021 

QQ7LX5Q0.031 

QQ8RAJ0C.021 

QQ8R9OIO.031 

QQ4Q1O83.031 

QQ7MN2Y0.011 

QQ8RAJ0C.031 

QQ95LU1T.011 

QQ4Q3MDC.011 

QQ7MN2Y0.021 

QQ9EUKVT.011 

QQ95LU1T.021 

QQ4Q3MDC.021 

QQ7MN2Y0.031 

QQ9EUKVT.021 

QQ95LU1T.031 

QQ4Q3MDC.031 

QQ7TC5UF.011 

QQ9EUKVT.031 

QQAURNUS.021 

QQ51DE36.011 

QQ7TC5UF.021 

QQ9IOOXO.021 

QQAURNUS.031 

QQ51DE36.021 

QQ7TC5UF.031 

QQ9IOOXO.041 

QQAV53P6.011 

QQ51DE36.041 

QQ7TQVER.011 

QQ9SOW8L.011 

QQAV53P6.021 

QQ6RQGH6.011 

QQ7TQVER.021 

QQ9SOW8L.021 

QQAV53P6.031 

QQ6RQGH6.021 

QQ7TQVER.031 

QQ9SOW8L.031 

QQBQ4SHI.011 

QQ6RQGH6.031 

QQ7TVADC.011 

QQ9SQIK9.011 

QQBQ4SHI.021 

QQ6RQGH6.041 

QQ7TVADC.021 

QQ9SQIK9.021 

QQBQ4SHI.031 

QQ6T71 10.011 

QQ7TVADC.031 

QQ9SQIK9.031 

QQBSS7WT.011 

QQ6T7 110.021 

QQ7U2T4R.011 

QQ9W0B9F.011 

QQBSS7WT.021 

QQ6T7 110.031 

QQ7U2T4R.021 

QQ9W0B9F.031 

QQBSS7WT.031 

QQ6Z59IG.011 

QQ7U2T4R.031 

QQ9W0B9F.041 

QQ70XM60.021 

QQ6Z59IG.021 

QQ7YP7QU.011 

QQ9U4FMU.011 

QQ7RH0RO.011 

QQ6Z59IG.031 

QQ7YP7QU.021 

QQ9U4FMU.021 

QQ7RH0RO.021 

QQ7PP9B9.011 

QQ7YP7QU.031 

QQ9U4FMU.031 

QQ7RH0RO.031 

QQ7PP9B9.021 

QQ7YZOJ3.011 

QQ9Y_SVF.011 

QQ7R51P9.011 

QQ7PP9B9.031 

QQ7YZOJ3.021 

QQ9Y  SVF.021 

QQ7R51P9.021 

QQ7PDU1X.011 

QQ7YZOJ3.031 

QQ9Y_SVF.031 

QQ7R51P9.031 

QQ7PDU1X.021 

QQ8_0DPT.011 

QQ9YH3QF.011 

QQ9TDSP3.011 

QQ7PDU1X.031 

QQ8  0DPT.021 

QQ9YH3QF.021 

QQ9TDSP3.021 

QQ7  PIPF.Oll 

QQ8  0DPT.031 

QQ9YH3QF.031 

QQ9TDSP3.031 

QQ7  PIPF.021 

QQ8  0DPT.041 

QQA2Tr4C.011 

QQA8OWOI.011 

QQ7  PIPF.031 

QQ8  2UQ9.011 

QQA2Tr4C.021 

QQA8OWO1.021 

QQ7_JT70.011 

QQ8  2UQ9.021 

QQA2TT4C.031 

QQA8OWOI.031 

QQ7_JT70.021 

QQ8  2UQ9.031 

QQA3HIRX.011 

QQBT22O6.011 

QQ7~JT70.031 

QQ800IG6.011 

QQA3HIRX.021 

QQBT22O6.021 

QQ738DYX.011 

QQ800IG6.021 

QQA3HIRX.031 

QQBT22O6.031 

QQ738DYX.021 

QQ800IG6.031 

QQA32UTF.011 

QQB090  9.011 

QQ738DYX.031 

QQ82OIU9.011 

QQA32UTF.021 

QQBO9O_9.021 

QQ75ULP9.011 

QQ82OIU9.021 

QQA32UTF.031 

QQBO9O_9.031 

QQ75ULP9.021 

QQ82OIU9.031 

QQA6U  EF.Oll 

QQBC7PP6.011 

QQ75ULP9.031 

QQ82SUTX.011 

QQA6U  IF.031 

QQBC7PP6.021 

QQ79  EYF.Oll 

QQ82SUTX.021 

QQA6U  IF.041 

QQBC7PP6.031 

QQ79”EYF.021 

QQ82SUTX.031 

QQAM4E3L.011 

QQCHCK  0.011 

QQ79  EYF.031 

QQ860ZNU.011 

QQAM4E3L.021 

QQCHCK_O.021 

QQ7BGDML.011 

QQ860ZNU.021 

QQAM4E3L.031 

QQCHCK  0.031 

QQ7BGDML.021 

QQ860ZNU.031 

QQARF2  X.011 

QQCDTKPO.Oll 

QQ7BGDML.031 

QQ89U  ZR.011 

QQARF2  X.021 

QQCDTKP0.031 

QQ7ETC8I.011 

QQ89U  ZR.021 

QQARF2  X.031 

QQCDTKP0.041 

QQ7ETC8I.021 

QQ89U_ZR.031 

QQAW>J8X.011 

QQCM5Y56.011 

QQ7ETC8I.031 

QQ8AfU26.011 

QQAWA38X.021 

QQCQQT8Y.011 

QQ7JAQCS.011 

QQ8ATU26.021 

QQAWA38X.031 

QQCQQT8Y.021 

QQ7JAQCS.021 

QQ8ATU26.031 

QQAYXZGU.Oll 

QQCQQT8Y.031 

QQ7JAQCS.031 

QQ8FGMVI.011 

QQAYXZGU.021 

QQCQQT8Y.041 

QQ7LX5Q0.011 

QQ8FGMVI.021 

QQAYXZGU.031 

Table  3.  List  of  files  used  in  this  experiment.  50  non-deceptive  cases  and  50  deceptive 
cases  from  setl,  set2  and  set3  are  listed  in  column  1  through  4  respective 


Set 

Features 

Setl 

5 

9 

21 

23 

5 

11 

21 

23 

5 

21 

23 

29 

Set2 

5 

14 

23 

29 

S 

9 

21 

23 

5 

21 

23 

29 

Set3 

14 

19 

24 

5 

21 

23 

29 

5 

21 

23 

9 

Average 

5 

9 

21 

23 

5 

21 

23 

29 

5 

21 

23 

11 

Table  5.  The  three  best  features  of  combination  4  for  each  set  and  their  average. 
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File 

Membership 

Defuzzified 

67.0000 

0.8498 

1.0000 

68.0000 

0.6969 

1.0000 

69.0000 

0.8397 

1.0000 

Result 


70,0000 


71.0000 


72.0000 


0.2901 


0.8291 


0.3982 


1.0000 


Misclassifled 


73.0000 


74.0000 


75.0000 


1.0000 


0.2463 


0.8043 


1.0000 


1.0000 


76.0000 

0.6676 

1.0000 

77 .0000 

1.0000 

1.0000 

78.0000 

1.0000 

1.0000 

79.0000 

1.0000 

1.0000 

80.0000 

0.7538 

1.0000 

81.0000 

1.0000 

1.0000 

82.0000 

1.0000 

1.0000 

83.0000 

0.8378 

1.0000 

84.0000 

1.0000 

1.0000 

85.0000 


0.8926 


1.0000 


86.0000 

0.5448 

87.0000 

0.5751 

Misclassifled 


88.0000 


89.0000 


90.0000 


0.8273 


0.2945 


0.9110 


1.0000 


1.0000 


91.0000 

1.0000 

1.0000 

92.0000 

1.0000 

1.0000 

93.0000 


94.0000 


95.0000 


96.0000 


0.2887 


0.2079 


0.5793 


Misclassifled 


97.0000 

1.0000 

1.0000 

98.0000 

0.7971 

1.0000 

99.0000 

0.8708 

1.0000 

100.0000 


1.0000 


1.0000 


Table  6.  Continued. 
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Membership  Defuzzified 


Result 


1.0000 

0.2579 

2.0000 

0.1307 

3.0000 


4.0000 


5.0000 


6.0000 


0.2652 


0.4345 


0.1175 


7.0000 

1.0000 

1.0000 

8.0000 

0.7086 

1.0000 

14.0000 


Misclassifled 


9.0000 

0.2856 

10.0000 

0.2745 

11.0000 

0.3056 

12.0000 

0.2720 

13.0000 

0.5019 

0.8871 


1.0000 


15.0000 


16.0000 


17.0000 


0.0912 


18.0000 


0.8334 


1.0000 


Misciassified 


19.0000 


20.0000 


21.0000 


0.5483 


22.0000 


23.0000 


24.0000 

0.1535 

25.0000 

0.4955 

26.0000 

0.1013 

27.0000 


31.0000 


1.0000 


1.0000 


28.0000 

0.3788 

29.0000 

0.1638 

30.0000 

0.0905 

32.0000 

0.1431 

33.0000 

0.0937 

Table  7.  Classification  of  the  files  in  set2. 


File 


34.0000 


35.0000 


Membership  Defuzzified 


Result 


36.0000 

0.1281  1 

37,0000 

0,3690 

38,0000 

0.5734 

39.0000 

0,1569 

40,0000 

0.3659 

41.0000 

0.4124 

42,0000 

0,1704 

1  43.0000 

0,4251 

44.0000 

0.0664 

45.0000 

0.5356 

46.0000 

0,5084 

47 .0000 

0.1735 

48.0000 


0.7512 


1.0000 


49.0000 

0,5115 

50.0000 

0,0976 

51.0000 

0,6361 

1.0000 

52.0000 

0,8482 

1.0000 

53.0000 


59.0000 


0.3471 


54.0000 

0,8822 

1.0000 

55.0000 

1.0000 

1.0000 

56.0000 

1.0000 

1,0000 

57 .0000 

1,0000 

1,0000 

58,0000 

0.8730 

1,0000 

60.0000 

0.0389 

61.0000 

0.3643 

Misclassified 


62.0000 

1,0000 

1.0000 

63.0000 

0.8174 

1.0000 

64.0000 

0,8875 

1.0000 

65.0000 

0.7995 

1.0000 

66.0000 

0.5919 

0 

67.0000 

0.7533 

1.0000 

Table  7.  Continued. 
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File 

Membership 

Defuzzified 

68.0000 

0.7337 

1.0000 

69.0000 

0.8524 

1.0000 

70.0000 

0.8602 

1.0000 

71*0000 


72.0000 


73.0000 


0.2217 


1.0000 


0.1268 


1.0000 


Result 


Misclassified 


74.0000 


75.0000 


76.0000 


81.0000 


82.0000 


83.0000 


0.8860 


0.2121 


0.1684 


1.0000 


77.0000 

0.6903 

1.0000 

78.0000 

0.7680 

1.0000 

79.0000 

0.8735 

1.0000 

80.0000 

0.8013 

1.0000 

0.1748 


0.5428 


0.8496 


1.0000 


Misclassified 


Misclassified 


84.0000 


0.3444 


85.0000 

0.8298 

1.0000 

86.0000 

0.8590 

1.0000 

87.0000 

0.6879 

1.0000 

88.0000 

0.9082 

1.0000 

89.0000 

0.6653 

1.0000 

90.0000 


93.0000 


94.0000 


0.1636 


91.0000 

0.8754 

1.0000 

92.0000 

0.8594 

1.0000 

0.5185 


0.4932 


95.0000 

0.7802 

1.0000 

96.0000 

0.8684 

1.0000 

97.0000 

0.8788 

1.0000 

98.0000 

1.0000 

1.0000 

99.0000 

1.0000 

1.0000 

100.0000 

0.8669 

1.0000 

Misclassified 


Table  7.  Continued. 
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Membership  Defuzzifled 


Result 


1.0000 

0.3986 

2.0000 

0.2845 

3.0000 

0.2562 

4.0000 

0.2786 

5.0000 

0.3226 

6.0000 


7.0000 


8.0000 


9.0000 


1.0000 


0.5055 


0.1434 


1.0000 


10.0000 


11.0000 


12.0000 


13.0000 


14.0000 


0.0691 


0.4744 


0.4708 


15.0000 


16.0000 


17.0000 


18.0000 

0.4623 

19.0000 

0 

20.0000 


21.0000 


0.2096 


22.0000 


23.0000 


24.0000 

0.0516 

25.0000 

0.2885 

26.0000 

0.0981 

27.0000 


0.9336 


1.0000 


28.0000 

0.2254 

29.0000 

0.1465 

30.0000 

0.0680 

31.0000 

0 

32.0000 

0 

33.0000 

0.0939 

Table  8.  Classification  of  the  files  in  Set3. 
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File 


34.0000 


35.0000 


36.0000 


Membershi 


0.3917 


Defuzzified 


Result 


37.0000 

0.1689 

38.0000 

0.5220 

39.0000 


40.0000 


41.0000 


42.0000 


0.0969 


43.0000 

0.4810 

44.0000 

0.3154 

45.0000 

0.4552 

46.0000 

0.3285 

47  .0000 

0.3690 

48.0000 

0.5593 

49.0000 

0.3522 

50.0000 

0.2325 

51.0000 

1.0000 

1.0000 

52.0000 

0.9052 

1.0000 

53.0000 

0.8115 

1.0000 

54.0000 

0.8397 

1.0000 

55.0000 

0.8754 

1.0000 

56.0000 


0.0930 


57.0000 

0.8330 

1.0000 

58.0000 

1.0000 

1.0000 

59.0000 

1.0000 

1.0000 

60.0000 

1.0000 

1.0000 

61.0000 

1.0000 

1.0000 

62.0000 

1.0000 

1.0000 

63.0000 

0.6496 

1.0000 

64.0000 


0.5075 


65.0000 


66.0000 


67.0000 


0.0823 


0.7810 


0.2356 


1.0000 


Misclassifled 


Table  8.  Continued. 
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File 

Membership 

Defuzzified 

68.0000 

1.0000 

1.0000 

69.0000 

1.0000 

1.0000 

70.0000 

1.0000 

1.0000 

71.0000 

1.0000 

1.0000 

72.0000 

1.0000 

1.0000 

73.0000 

1.0000 

1.0000 

74.0000 

1.0000 

1.0000 

75.0000 

1.0000 

1.0000 

76.0000 

1.0000 

1.0000 

77 .0000 

1.0000 

1.0000 

78.0000 

1.0000 

1.0000 

79.0000 

1.0000 

1.0000 

80.0000 

0.6068 

1.0000 

81.0000 

0.9054 

1.0000 

Result 


82.0000 


0.4134 


83.0000 


84.0000 


85.0000 


1.0000 


0.2914 


1.0000 


Misclassified 


86.0000 

1.0000 

1.0000 

87.0000 

1.0000 

1.0000 

88.0000 

0.8786 

1.0000 

89.0000 

0.9018 

1.0000 

90.0000 

1.0000 

1.0000 

91.0000 

1.0000 

1.0000 

92.0000 

1.0000 

1.0000 

93.0000 

0.9135 

1.0000 

94.0000 

0.8292 

1.0000 

95.0000 

0.7423 

1.0000 

96.0000 

1.0000 

1.0000 

97 .0000 


0.0902 


98.0000 


99.0000 


100.0000 


0.2564 


0.4387 


Misclassified 


Table  8.  Continued. 
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function  v=armod(var,M) 

%  This  function  finds  the  autoregressive  parameter  fo  the  signal 
%  and  then  prewhitens  the  signal  using  the  prewhiten  filter. 

%  Recursive  Levinston  and  durbin  algorithm  is  used  to  find  the  AR  parameters 

%  To  use  the  function  the  user  should  enter  the  signal  and  the  AR  model  order 
%  eg  armod(variable,  model  order) 


Fs=30; 


r=xcorr(var, 'biased'); 

K=length(var); 

rx=r(K:K+M+l); 


%sampling  fi’equency 

%rx(0)  is  at  index  K 
%rx(0),rx(  1 ), .  .rx(M) 


%  Estimate  the  reflection  coefficients 

a(l,l)=l; 

P=rx(l); 


for  k=0:M-l 
accum=0; 
for  m=0:k 

accum=accum+a(k+l,m+l)*rx(k-m+2); 

end 

gamma(k+2)=-accum/P; 

P=P  *  ( 1  -abs(gamma(k+2))^2); 
a(k+2,l)=l; 

a(k+2,k+2)=gamma(k+2); 
for  m=l:k 

a(k+2,m+ 1  )=a(k+ 1  ,m+l  )+gamma(k+2)*a(k+ 1  ,k-m+2); 
end 
end 

parameter=a(M+ 1 , ; ); 

bb=[l]; 

aa=a(M+l,:); 


v=filter(aa,bb,var); 


function  freq==fundfreq(frag) 

%  This  function  called  fundfreq  (stands  for  fundamental  frequency) 

%  finds  the  fundamental  frequency  of  the  desired  signal. 

%  for  the  K  interval  of  a  question  using  autocorrelation  function. 

%  For  a  periodic  signal  with  the  period  p,  the  autocorrelation  function 
%  attains  a  maximum  at  0,p,2p,.. 

%  regardless  of  the  time  origin  of  the  signal,  the  period  can  be  estimated 
%  by  finding  the  location  the  first  maximum  in  the  autocorrelation  function. 


%For  using  this  function  the  user  should  enter  the  file  segment  fundfi’eq(frag). 


Fs  =  30;  %Sampling  frequency 

K=length(frag); 

y  =  xcorr(frag);  %  finds  the  autocorralation  function 

q  =  diff(abs(y(K:2*K-l)));  %  differentiates  the  variable 

z  =  q>0;  %  z  =  1  if  q  is  greater  than  0 

f  =  diflf(z);  Vofinds  the  indices  where  the  2nd  derivative 

%is  -1  or  +1  which  indicates  peaks  and  valleys 

peak  =  find(f<0);  %finds  the  peak  indices 


m  =K+peak; 

[i,j]=max(abs(y(m)));  %finds  the  maximum  peak  value  and  its  index 

lofreq  =find(f>=0); 

if  Iength(loffeq)=length(f) 
freq=0; 
else 

freq  =  Fs/peak(j); 
end 


function  y=croscor(varl,var2) 


%  This  function  finds  the  cross  correlation  between  two  variables 
%  The  first  variable  is  prewhitened  first  by  calling 
%  armod  (stands  for  AR  modeling)  program. 

%  The  function  returns  maximum  and  minimum  of  the  croscorrelation 
%  and  the  lag  that  these  maximum  and  minimum  happen. 

%To  use  this  command  the  user  must  enter  the  two 
%variable  names  to  be  correlated. 

% 

%  eg.  croscor(variable  1  ,variable2) 

K=min(length(var  1  ),length(var2)); 


M=  10,  %  Model  order 

v  1  =annod(var  1  ,M); 

yd=  xcorr(vl  (20;K),var2(20;K), 'biased'); 

[maximum  lagmax]=max(real(yd)); 

[minimum  lagmin]=min(real(yd)); 
y=[maximum  lagmax  minimum  lagmin]; 


function  feature=  feature(file_name, relevant, irrelevant, control,features, offset, CR_feature) 


%  This  function  produces  a  feature  vector  for  a  given  file 
%  Relevent,  irrelevent,  and  control  are  vectors  which  contain 
%  the  questions  these  features  are  extracted  from. 

% 

%  eg.  featurev(t79,[3  5],[1  4],  [6  10],feature_list) 

%  The  above  example  gives  the  features  for 
%  the  file  t79  of  the  3rd  and  5th  question  which  are  relevent  in  this 
%  MGQT  format,  the  1st  and  4th  question  which  are  irrelevent 
%  and  the  6th  and  10th  questions  which  are  control 

%  feature_list=[TOmean(ffag ) 

%  '20curve(ffag )'; 

%  '30area(frag ) ']; 


featurejist  =  features; 


%  The  channels  are  ordered  as  follows: 

%  1:GSR,  2:HiCardio,  3:LowCardio,  4;DerLowCardio,  5:LowResp,  6:UpResp 

%  This  is  a  matrix  of  the  time  delay  after  asking  a  question  to  start  of  extracting 
%  the  feature,  and  finish  extracting  the  feature  for  each  channel. 

Times=[ 

2,  14; 

3,9; 

3,  18; 

1,8; 

2,  18; 

2,  18]; 

%  These  are  preprocessing  functions. 

Preprocess=[  'detgsr'; 

'dethic'; 

'detlc '; 

'dercd '; 

'detlr '; 

'detur ']; 


dat  a=zero  s(6,  length(file_name(: ,  5 ))); 

%  Standardize  and  detrend  the  channels  and  derive  new  channels 
for  i=l;6, 

data(i,:)=eval([Preprocess(i,;),'(file_name)'])'; 

end 


marker  =  file_name(:,5);  %  0  begin  test  and  end  test 
%  0  examiner  begins  asking  question 
%  1  examiner  finishes  asking  question 
%  2  subject  begins  response  to  question 
%  9  does  not  mark  an  event 

begin  =  find(marker  =  0);  %  finds  indecies  where  marker  =  0  (question  begins) 

begin=begin(2:length(begin));  %  elliminates  the  marker  at  the  beginning  of  the  test 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


%-t  4H  H  -t  -t  +4+++++4  M  I  I  I  I  I  I  M  I  ■<  M  H  H  ^  ^  H  H-+++ ++++■!■■  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I  I 
M  M  M  )  -f  +4  ■I  +++ 

%  This  for  loop  creates  feature  vectors  for  each  relevant  quesion 
% 

%  eg  X  =  [mean(gsr),std(gsr),area(gsr),mean(lr),std(lr),area(lr),etc . 

%  curve  length, amplitude  of  peaks,#  of  peaks] 

%+++++++++  M  M  I  I  I  I  M  I  I  H  I  I  M  -H  I  H  IM  I  I  4  ^  4 -H  "I  +  ++H  I  I  I  I  I  I  I  I  I  I  I 

-H-+  I  I  I  I  I  I  I  I  I  I  t")  +-H 

feature_count=l; 

for  i  =  I;max(find(relevant~=0)), 
question=relevant(i); 

for  j=l  :length(featurejist(:,l)) 

channel_number=eval(feature_list(j,  1 )); 

second_channel=eval(featureJist(i,2)); 

st=begin(question)+30*Times(channel_number,  1 ); 

fh=begin(question)+30*Times(channel_number,2); 

st2=begin(question)-30*Times(channel_number,2); 

fh2=begin(question)-3  0*Times(channel_number,  1 ); 

fr=feature_list(j,3:length(feature_Iist(l,:))); 

frag=data(channel_number,  st  :fh); 

frag2  =  data(channel_number,st2:fii2); 

if  second  channel  ~=  0 


end 


end 


st3=begin(question)+30*Times(second_channel,  1 ); 
fh3=begin(question)+30*Times(second_channel,2); 
frag3  =  data(second_channel,st3:fh3); 
end 

tempy=eval(fr); 

for  m  =  1  :length(tempy) 

x(feature_count)  =  tempy(m); 
feature_count=feature_count+ 1 ; 


O/o - 

%  Irrelevant  questions 


feature_count=l; 

for  i  =  I:(max(find(irrelevant'-=0))-offset) 
question=irrelevant(i); 
for  j=l  :length(feature_list(:,l)) 

channel_number=eval(feature_list(j,  1 )); 

second_channel=eval(feature_list(j,2)); 

st=begin(question)+3  0*Times(channeI_number,  1 ); 

fh=begin(question)+30*Times(channel_number,2); 

st2=begin(question)-30*Times(channel_nuniber,2); 

fn2=begin(question)-30*Times(channeI_number,l); 

fT=feature_list(j,3  :length(feature_list(l ,;))); 

frag=data(channel_number,st;fh); 

frag2  =  data(channel_number,st2:fh2); 

if  second_channel  ~=  0 

st3=begin(question)+30*Times(second_channel,  1 ); 
fh3=begin(question)+30*Times(second_channel,2); 
ffag3  =  data(second_channel,st3;fii3); 
end 

tenipy=eval(fr); 

for  m  =  1  :length(tempy) 

y(feature_count)  =  tempy(m); 
feature_count=feature_count+ 1 ; 
end 


end 


end 


o/o - 

%  Control  questions 

feature_count=l; 

for  i  =  1  ;niax(find(control'~=0)), 
question=control(i); 

for  j=l  :length(feature_list(:,l)) 

channel_number=eva](feature_list(j ,  1 )); 

second_channel=eval(feature_list(j,2)); 

st=begin(question)+30*Times(channel_number,  1 ); 

fh=begin(question)+30*Tinies(channel_number,2); 

st2=begin(question)-30*Tinies(channel_nuniber,2); 

fh2=begin(question)-3  0  *  Tinies(channel_number,  1 ); 

fp=feature_list0,3:length(feature_list(l,:))); 

frag=data(channel_number,st;fh); 

frag2  =  data(channel_number,st2;fh2); 

if  second_channel  ~=  0 

st3=begin(question)+3  0*Times(second_channel,  1 ); 
fh3=begin(question)+30*Times(second_channel,2); 
frag3  =  data(second_channel,st3:fh3); 
end 

tempy=eval(fr); 

for  m  =  1  :length(tempy) 

z(feature_count)  =  tempy(m); 
feature_count=feature_count+l ; 
end 
end 
end 

o/o - 


%  control  &  relevant 


feature_count=l; 

for  i  =  I;max(find(relevant~=0)), 

for  k=l  :max(find(control~=0)), 

q(k)=abs(relevant(i)-control(k)); 

end 


[a  b]=min(q); 


question  1  =relevant(i); 
question2=control(b); 

for  j=l  :length(CR_feature(:,l)) 

channel_number=eval(CR_feature(j,  1 )); 

st=begin(questionl  )+30*Times(channel_number,  1 ); 

fii=begin(questionl)+30*Tinies(channel_number,2); 

st2=begin(question2)+3  0*Times(channel_number,  1 ); 

fii2=begin(question2)+30*Times(channel_number,2); 

fr=CR_feature(j,3  ;length(CR_feature(  1 ,;))); 

fragl=data(channel_number,st;fii); 

ffag2=data(channel_number,st2:fii2); 

tempy=eval(fr); 

for  m  =  l:length(tempy) 

w(feature_count)  =  tempy(m); 
feature_count=feature_count+ 1 ; 
end 
end 
end 


feature=[x,y,z,w]'; 


function  isd_dif^isd(fragl,frag2) 


%  This  is  a  integrated  spectral  difference(isd)  function  that  finds  the  cumulativespectral 
%  density  of  a  control-relevant  pair,  then  calculates  the  difference  between  the 
%  isd  of  control  and  the  relevant  for  a  part  of  a  question. 

%  This  function  returns  the  max  or  min  and  the  frequency  (points) 

%  where  this  max  or  min  happens  and  the  area  underneath  this  difference. 

%  To  use  this  command  the  user  must  enter  the  two  variable  names. 

%  The  first  variable  is  a  control  question  fragment  and  the  second  is 
%  a  relevant  question  fragment. 

%  eg .  isd  1  (variable  1  ,variable2) 

Fs  =  30; 

K=min(length(frag  1  ),length(frag2)); 


nnp  =1; 
np  =  2'^nnp; 

L  =  K/np; 

L=2^(nextpow2(L)); 

M=  spectrum  (fragl,L); 
N=  spectrum  (frag2,L); 

pqc  =  cumsum(M(:,l)); 
pqr  =  cumsum(N(:,l)); 


%spectral  density  of  the  first  (control)  question 
%spectral  density  of  the  second(relevant)  question 

%Cumulative  sum  of  the  integrated  spectral  density 
%Cumulative  sum  of  the  integrated  spectral  density 


clear  M 
clear  N 

he  =  pqc/pqc(L/2); 
hr  =  pqr/pqr(L/2); 


CR_dif=  hr’  -  he'; 

if  (abs(max(CR_dif))>abs(min(CR_dif))) 
[CR_dif,  mpoint]=max(CR_dif); 
else 

[CR_dif  ,mpoint]=min(CR_dif); 
end 

isd_dif=[  CR_dif  mpoint  trapz(hr'-hc')]; 


'lOmean(frag)  ' 

'10curve(frag)  ’ 

'lOarea(frag)  ' 

'10med_dif(frag,8)  ' 

'10inax_inin(frag)  ' 

'10max(firag)  ' 

’lOmin(frag)  ' 

'10mdif(frag)  ' 

'20mean(frag)  ' 

’20auve(frag)  ' 

'20area(&ag)  ' 

'20ampcard(fiag)  ' 

'20dainpcard(frag)  ' 

'20peaknunic(&ag)  ’ 

'20med_dif(frag,5)  ’ 

'20inax_inin(frag)  ' 

'20inax(frag) 
'SOminCfrag)  ' 

'20min(frag)  ' 

'20mdif(frag)  ' 

'20minampc(frag)  ' 

’30mean(frag)  ' 

'30curv'e(frag)  ' 

’30area(frag)  ' 

'30med_dif(frag,5)  ' 

'30max_min(frag)  ' 

'30max(frag)  ' 

'30mdif(frag)  ' 

'40mean(fTag)  ' 

'40min(frag)  ' 

'40mdif(frag)  ' 

'40cur\’e(frag)  ’ 

’40area(frag)  ' 

'40med_dif(frag,5)  ' 

'40max_niin(frag)  ' 

’40max(fTag)  ’ 

'50mean(frag)  ' 

’50curve(frag)  * 

’50area(frag)  ’ 

'50ainpr(frag)  ' 

'50peaknumr(frag)  ' 

'50ie(frag)  ' 

’50dampr(frag)  ' 

’50ieie(frag,  frag2)  ’ 

'50med_dif(frag,8)  ' 

'50max_min(frag)  ' 

'50inax(fTag)  ' 

'50min(frag)  ' 

•50mdif(frag) 
'50ininampr(frag)  ’ 

‘60mean(ftag)  ' 


i  4  » 


»  * 


'60cun'e(frag) 

'60area(frag) 

'60ampr(frag) 

’60dampr(frag) 

'60peaknumr(frag) 

'60ie(fTag) 

'60ieie(frag,  fTag2) 
'60med_dif(frag,8) 
'60max_inin(frag) 

'60inax(fTag) 

'60min(fTag) 

'60mdif(frag) 

'60minampr(fTag)  *; 

'lOstd(frag) 

’20std(frag) 

'30std(frag) 

'40std(frag) 

'50std(frag)  *; 

'60std(frag) 

'20annodl(frag) 

'20corl(frag) 

'50corl(ftag) 
'26croscor(frag,fTag3) 
’26psdcohl(frag,frag3)  ']; 

CR_feature=[ 

*10isdl(fragl,frag2) 
'20isdl(fragl,frag2)  ']; 


lf=length(feature_list(;,l)); 
cd  \mgqt\gl 
files! 
for  d=l:3 

ifd=2 

cd  \mgqt\g2 
files2 

el  seif  d=3 

cd  \mgqt\non_dec 
filesn 
end 

for  k=l;Iength(flist(;,l)) 
file_name=[flist(k, ;)] ; 
flength=length(file_name); 

question=['ZZ',num2str(file_name(3  :flength- 1  )),'4']; 

%  creates  the  name  of  the  file  that  holds  the  questions(zz*.014) . 


! 


eval(['load file_nanie]);  %  load  the  data  &  the  file  with  the 

eval(['load  question]);  %  question  number 

file_name=file_name(l  :flength-4);  %eleminates  the  extention(.013) 
question=question(l  :flength-4) ;  %  in  order  to  use  the  data. 

Q=eval(question); 

l_rel=max(find(Q(2,;)~=0));  %The  length  of  relevant  questions 

l_con=max(find(Q(4,:)--=0));  %The  length  of  control  questions 

l_irr=max(find(Q(3,:)*^));  %The  length  of  irrelevant  questions 

qover  =l_con+i_rel+i_irr-10;  %  finds  the  number  of  questions  over  10 

offset=qover*(qover>0); 

CRIength=l_rel*6; 

size_M=(10+(qover<0)*qover)*(If+18)+CRlength;  %total  size  of  features 

initial=zeros(10*(18+lf)+30,l);  %Initializing  M  with  a  10*lf  zeros 
M(;,k)=initiai; 

M(l:size_M,k)=feature(eval(file_name),[Q(2,:)],[Q(3,:)],[Q(4,:)],feature_list, offset, C 
R_feature); 

eval(['clear  ',upper(file_name)]) 
eval(['clear  ',upper(question)]) 


end 


save  new_feat  M  If  flist 
clear  M 


clear 

featlength=23; 
load  new_feat 
for  k=l:length(flist(;,l)) 

file_name=[flist(lc, :)]; 
flength=!ength(file_name); 
question=['ZZ',num2str(file_name(3  iflength- 1  )),'4']; 

eval(['load  '.question]);  %  load  the  file  with  the  question  numbers. 

Q=eval(question(  1  :flength-4));  %  in  order  to  use  the  data. 
l_rel=max(find(Q(2,:)--=0));  %The  length  of  relevant  questions 

l_con=max(find(Q(4,:)--=0));  %The  length  of  control  questions 

l_irr=max(find(Q(3,  :)~=0));  %The  length  of  irrelevant  questions 


%  Averaging  relevant  questions 
for  j=l  ;lf-5+featlength 
m=a-l)*7; 
clear  r 
for  i=l  :l_rel 

r(i)=M((i-l)*(lf-5+featlength)+j,k); 

end 


feat_vec(m+l  ,k)=mean(r); 
feat_vec(m+2,k)=mean(r); 
feat_vec(m+3,k)=max(r); 
feat_vec(m+4,  k)=min(r); 
feat_vec(m+5,k)=max(r); 
feat_vec(m+6,k)=niin(r); 
feat_vec(m+7,  k)=max(r); 
end 

qover  =l_con+l_rel+l_irr-10 ; 
ofFset=qover  *  (qover>0); 
l=(l_irr-offset+l_rel)*(lfi5+featlength); 
crJ=l+l_con*(lf-5+featlength); 


%finds  the  feature  values 
%for  all  the  relevant  questions. 

%retums  mean  value  for  relevant 


%The  number  of  questions  over  10 

%The  position  of  the 
%first  control  question 


%■ 


%  Averaging  control  questions 

for  j= 1  ;lf-5+featlength 
clear  c 
m=(j-l)*7; 
for  i=l  ;l_con 

c(i)=M((i-l)*(lf-5+featlength)+j+l,k); 


%finds  the  feature  values  for 


end 


%all  the  control  questions. 


%feature  values  for  control  questions 

f(m+ 1 ,  k)=feat_vec(ni+ 1  ,k)-niean(c); 

if  (feat_vec(ni+2,k)+mean(c)==0) 
f(m+2,k)=100; 
else 

f(ni+2,k)=2*(feat_vec(m+2,k)- 

inean(c))/(feat_vec(m+2,k)+mean(c));  %for  every  feature, 
end 

f(m+3,k)=feat_vec(m+3,k)-max(c); 
f(m+4,k)=feat_vec(m+4,k)-niin(c); 
f(m+5,k)=feat_vec(m+5,k)-min(c); 
f(m+6,k)=feat_vec(m+6,k)-max(c); 
if  max(c)=0 

f(m+7,k)=100; 

else 

f(m+7,k)=feat_vec(ni+7,k)/max(c); 

end 

end 


o/o - 

%  feature  values  for  control_relevant 

for  j=l;6 

m=(j-ir3; 
clear  cr 
for  i=l  :l_rel 

cr(i)=M((i-l)*6+j+cr_l,k); 

end 

f(m+ 1 +(lf-5+featlength)*  7,k)=mean(cr); 
fl^m+2+(lf-5+featlength)*7,k)=max(cr); 
f(m+3+(lf-5+featlength)*7,k)=min(cr); 
end 

decep(  1  ,k)=Q(  1:1);  %  finds  if  file  is  deceptive  or  not 

%  creates  1  if  deceptive  and  0  if  not. 
eval(['clear  ',upper(question(l  :flength-4))]); 
end 


save  fh_dec  f  decep 
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I.  Introduction 


Polygraph  examinations  are  the  most  widely  used  method  to  distinguish  between  truth  and 
deception.  In  a  Polygraph  examination  a  person  is  connected  to  a  special  instrument 
called  a  Polygraph  which  records  several  physiological  signals  such  as  blood  pressure. 
Galvanic  Skin  Response,  and  respiration.  The  subject  is  asked  a  set  of  questions  by  an 
examiner.  By  looking  at  these  signals  the  examiner  is  able  to  determine  the  reactions  of 
the  subject  to  the  questions  and  decide  whether  the  person  was  truthful  or  deceptive  in 
answering  each  question.  The  problem  with  human  classification  of  Polygraph  tests  is  that 
the  outcome  depends  on  the  examiner's  experience  and  personal  opinion.  Automatic 
scoring  of  Polygraph  tests  has  been  a  subject  of  extensive  research.  Several  methods  for 
Polygraph  classification  have  been  studied  which  are  mostly  based  on  statistical 
classification  techniques. 

In  this  study  two  main  goals  were  presented.  The  first  goal  was  finding  appropriate 
features  which  have  physiological  basis.  The  second  purpose  was  trying  a  new 
classification  method  based  on  fuzzy  set  theory.  The  advantage  of  using  fuzzy  logic  is  that 
the  it  does  not  simply  assigns  each  input  to  one  of  the  classes,  but  it  gives  the  possibility  of 
belonging  of  an  input  to  each  class. 

Digitized  Polygraph  data  used  in  this  project  were  collected  from  various  police  stations. 
The  data  files  were  organized  according  to  the  test  format  used  and  were  decoded  to 
ASCII  format  so  they  can  be  read  by  Matlab.  Preprocessing  and  feature  extraction 
routines  were  implemented  in  the  Matlab  language.  Three  sets  of  files  were  chosen,  each 
one  of  them  contained  50  deceptive  and  50  non-deceptive  files.  These  files  are  listed  in 
Table  10  in  Appendix  A.  A  set  of  features  were  selected  based  on  physiological  reactions, 
and  the  feature  vectors  for  every  file  in  each  set  were  found.  Different  classification 
methods  were  studied  and  a  Fuzzy  K-nearest  neighbor  classifier  was  selected. 

Significance  of  each  feature  was  examined  according  to  the  clustering  and  correct 
classification  obtained  by  using  that  individual  feature.  Thirty  features  were  selected  as 
the  final  set  of  features  and  a  subset  of  combinations  of  2  to  4  of  these  features  were 
examined  to  study  the  effects  of  combining  the  features  on  classification  results.  The 
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combination  that  produced  the  best  classification  for  all  three  sets  on  the  average  was 
selected  and  the  effects  of  changing  the  classifier  parameters  on  classification  was  studied. 
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II.  Polygraphs* 


A  polygraph  examination  is  the  most  popular  method  used  to  determine  if  an  indi\ddual  is 
being  truthful  or  deceptive.  During  an  examination,  a  subject  is  asked  a  series  of  questions 
and  the  physiological  responses  to  the  questions  are  recorded  using  a  polygraph.  The 
three  physical  responses  currently  obtained  from  a  polygraph  examinations  are  blood 
pressure,  respiration,  and  skin  conductivity.  Polygraph  charts  are  usually  analyzed  by  a 
human  interpreter  for  evidence  of  truth  or  deception;  however,  computer  algorithms  are 
now  being  used  to  verify  results  [1][2]. 


11.1.  History 

The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception  occurred 
around  1 895  [3].  That  was  the  year  that  Caesar  Lombroso  published  the  results  of  his 
experiments  in  which  a  hydro sphygmograph  was  used  to  measure  the  blood  pressure-pulse 
changes  of  criminals  in  order  to  determine  whether  or  not  they  were  deceptive.  Although 
the  hydro  sphygmograph  was  originally  intended  to  be  used  for  medical  purposes, 
Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may  have  been  the  first 
to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a  suspect  a  series  of 
photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the  suspect  did  not 
react  more  to  the  victims  picture  than  the  pictures  of  the  other  children,  Lombroso 
concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and  therefore  was  not 
the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by  measuring 
recorded  respiration  tracings  [4].  He  found  that  if  the  length  of  inspiration  were  divide  by 
the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before  lying  and  also 
before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson  constructed  an 
instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and  respiration 
during  an  examination  [3]  [4].  Larson  reported  accurate  results  which  prompted  Leonarde 
Keeler  to  construct  a  better  version  of  this  instrument  in  1926  [3][4]. 


*  This  section  is  exerpted  from  [17] 
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The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the  century.  It's 
usefulness,  however,  did  not  become  evident  until  the  1930's  during  which  time  several 
articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New  York  [4], 

In  these  articles  he  reports  over  90  criminal  cases  in  which  examination  using  the  galvanic 
skin  response  had  all  been  successful  and  confirmed  by  confession  or  supplementary 
evidence.  The  usefulness  of  the  galvanic  skin  response  prompted  Keeler  to  add  an 
galvanometer  to  his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler 
Polygraph  recorded  blood  pressure-pulse,  respiration,  and  galvanic  skin  response  [3]. 


II.2  Modern  Test  Formats 


The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant  questions 
about  an  issue,  control  questions  that  provide  a  physical  response  for  comparison,  and 
irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response  for  comparison 
[1][4].  Three  general  types  of  test  formats  are  in  use  today.  These  are  Control  Question 
Tests,  Relevantrirrelevant  Tests,  and  Concealed  Knowledge  Tests.  Each  of  the  general 
test  formats  may  have  a  number  of  more  specific  variations.  Each  test  consists  of  two  to 
five  charts  containing  a  prescribed  series  of  questions.  The  test  format  that  is  used  in  an 
examination  is  determined  by  the  test  objective  [3][4]. 

The  concealed  knowledge  test,  also  called  peak  of  tension  test,  is  used  when  facts  about  a 
crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a  subject 
would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example,  if  a  gun 
was  used  in  a  crime  and  the  public  did  not  know  the  caliber,  an  examiner  could  ask  a 
suspect  if  it  was  a  22  caliber ,  a  38  caliber,  or  a  9  mm.  If  the  gun  used  was  a  9  mm  and 
the  suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of 
deception. 

A  control  question  test  is  often  used  in  criminal  investigations.  In  this  type  of  test  a  series 
of  relevant,  irrelevant,  and  control  questions  are  asked.  A  relevant  question  is  one  which 
is  specific  to  the  crime  being  investigated.  For  example, "  Did  you  steal  the  money?".  A 
control  question  is  designed  to  make  the  subject  feel  uncomfortable.  It  is  not  specific  to 
the  crime  being  investigated  however  it  may  be  related  in  an  indirect  way.  A  control 
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question  that  could  follow  the  relevant  question  stated  above  is  "Have  you  ever  taken 
anything  that  did  not  belong  to  you?".  The  control  questions  are  compared  to  the  relevant 
questions  and  if  the  responses  to  the  relevant  questions  are  greater,  the  subject  is  usually 
classified  as  deceptive.  Irrelevant  questions  are  used  as  buffers.  Examples  of  irrelevant 
questions  are  "Are  the  lights  in  this  room  on?"  or  "Is  today  Monday?". 

Relevant-Irrelevant  tests  are  usually  used  to  test  people  trying  to  obtain  security  clearance 
or  get  a  job.  In  this  test,  relevant  questions  are  compared  to  irrelevant  questions.  Very 
few  control  questions  are  asked.  The  purpose  of  control  questions  in  this  test  is  to  make 
sure  that  the  subject  is  capable  of  reacting  at  all. 


II.3  Present  Day  Equipment 


The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in  1945 
and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [1][1 1].  The  Reid 
polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biological  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 
The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way.  Both  machines  record  the  same  biological  signals 
using  standard  methods.  Blood  pressure  is  measured  by  placing  a  standard  blood  pressure 
cuff  on  the  arm  over  the  brachial  artery.  Respiration  is  monitored  by  placing  rubber  tubes 
around  the  abdominal  area  and  the  chest  of  the  subject.  This  results  in  two  signals,  an 
upper  and  lower  respiratory  signal.  Skin  conductivity  is  measured  by  placing  electrodes 
on  two  fingers  of  the  same  hand. 
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III.  Feature  Extraction  and  Classification 


III.l  Introduction 


The  problem  of  Classification  of  Polygraph  data  like  other  pattern  recognition  problems 
can  be  considered  of  consisting  of  several  main  stages.  Figure  [1]  shows  these  stages  and 
the  relationship  between  them.  At  the  beginning  data  is  preprocessed  so  that  noise  and 
redundancies  are  removed  from  data  and  feature  extraction  can  be  done  more  accurately. 
The  next  stage  is  feature  extraction.  In  this  step  data  is  read  and  appropriate  features  are 
extracted  from  it.  This  is  a  very  important  step  in  all  pattern  recognition  problems, 
because  the  purpose  of  pattern  recognition  is  finding  similarities  in  data  that  belong  to  the 
same  class,  and  features  are  elements  that  represent  these  similarities.  Therefore,  a  good 
set  of  features  can  lead  to  good  classification  whereas  a  satisfactory  result  cannot  be 
achieved  with  an  inappropriate  set  of  features.  Having  a  set  of  features,  the  next  step  is  to 
use  a  method  to  classify  data  using  these  features.  These  steps  as  applied  to  Polygraph 
classification  are  described  in  more  details  in  the  following  sections. 


POLYGRAPH  CLASSIFICATION 


Figure  1 
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III.2.  Preprocessing 


Polygraph  data  consists  of  signals  from  four  different  channels;  galvanic  skin  response 
(GSR),  blood  pressure,  higher  respiration,  and  lower  respiration.  First  blood  pressure 
signal  was  decomposed  into  a  high  frequency  component  showing  heart  pulse,  and  a  low 
frequency  component  showing  blood  volume.  Derivative  of  the  blood  volume  channel 
was  taken  and  used  as  another  channel.  These  six  derived  signals  were  detrended  and 
filtered.  For  more  details  on  preprocessing  refer  to  [17]. 
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III.3.  Feature  Extraction 


In  this  step  appropriate  features  are  selected  and  extracted.  Feature  extraction  is  itself 
divided  into  several  steps.  Figure  [2]  shows  different  stages  involved  in  feature  extraction. 

By  feature  gathering  we  mean  selecting  features  that  might  have  useful  information  in 
them.  Feature  Combination  is  a  special  step  in  polygraph  classification.  In  this  step 
features  derived  for  different  questions  in  a  test  are  combined  to  build  a  single  feature, 
feature  selection  is  a  step  in  which  a  small  number  of  features  is  selected  fi’om  the  main 
feature  set  to  be  used  in  final  classifier  section. 


Figure  2 
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III.3.1.  Feature  Gathering 


Features  that  possibly  convey  some  information  in  them  were  selected  and  extracted  in  this 
stage.  Literature  about  Polygraph  were  studied  and  several  Polygraph  examiners  were 
interviewed  to  find  out  what  had  been  done  about  this  problem  and  what  characteristics  in 
a  signal  are  used  as  indicators  of  truth  or  deception.  In  general  features  are  divided  into 
three  main  groups,  time  domain  features,  frequency  domain  features  and  correlation 
features.  Time  domain  features  are  mostly  standard  characteristics  like  mean,  standard 
deviation,  median  and  so  on.  Some  more  specific  time  domain  features  were  also  added, 
such  as  the  ratio  between  inhalation  and  exhalation.  Auto  Regressive  parameters  were 
also  extracted  and  tried  as  features.  To  extract  each  feature  for  each  question  a  time 
frame  was  considered  that  started  with  a  specific  delay  after  each  question  was  asked  and 
lasted  for  a  specific  amount  of  time.  Different  time  frames  were  used  for  different 
channels  because  each  channel  represents  a  different  physiological  parameter.  Frequency 
domain  features  include  fundamental  frequency,  magnitude  of  power  spectral  density  at 
fundamental  frequency,  coherency  at  fundamental  frequency  and  so  on.  Figure  3  shows 
the  feature  gathering  and  the  decisions  that  involved  in  this  step. 


Figure.  3 


10 


For  every  question  in  a  test  93  features  were  selected  and  extracted  .  Also  6  Integrated 
Spectral  Density  features  were  used  which  directly  compare  each  relevant  question  to  the 
nearest  control  question.  The  total  number  of  features  derived  for  each  test  was  : 

93x10+6x5  =  960 

This  was  repeated  for  all  the  tests  in  feature  sets  1, 2  and  3.  The  results  of  each  set  were 
saved  in  a  960x100  matrix  called  the  M  matrix. 

For  a  detailed  description  of  time  domain  features  and  frequency  domain  features  refer 
respectively  to  [17]  and  [16], 


III.3.2.  Feature  Combination 


As  mentioned  earlier  each  feature  is  extracted  for  all  questions  in  a  test,  that  is  for 
relevant,  irrelevant,  and  control  questions.  In  a  polygraph  test  responses  to  relevant 
questions  are  compared  to  responses  to  irrelevant  and  control  questions.  But  in  any  test 
there  are  several  questions  of  each  type  and  many  methods  can  be  used  to  combine  them. 
Figure  [4]  shows  different  methods  to  combine  the  features.  It  was  decided  not  to  use 
irrelevant  questions  in  this  study,  because  in  a  Controlled  Question  Polygraph  Test 
comparison  between  the  responses  to  relevant  and  control  questions  is  the  most  important 
factor.  For  most  of  the  features  seven  methods  were  tried  to  combine  features  of  different 
questions  in  a  test.  For  the  last  six  features  three  ways  to  combine  them  were  tried.  These 
methods  were  finding  the  average,  maximum  and  minimum  of  relevant-control  pairs.  The 
first  93  features  combined  in  seven  ways  and  six  integrated  spectral  density  features  were 
combined  in  three  ways  so  the  total  number  of  features  at  this  stage  was  equal  to: 

("93x7;  +  ('6x3;  =  669 
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Figure  4 


III.3.3  Feature  Selection 

Feature  selection  was  done  in  two  independent  steps,  reduction  and  combination.  Figure 
[5]  shows  the  relationship  of  these  two  steps.  These  two  steps  are  explained  in  the 
following  two  sections. 


Figure.  5 


III.3.3.1  Feature  Selection  (Reduction) 


The  next  step  in  our  Feature  Extraction  was  to  reduce  the  number  of  features  to  a  number 
so  that  a  practical  algorithm  can  be  used  to  select  the  feature  set  from  them.  It  was 
decided  to  bring  down  the  number  of  features  from  669  to  30  at  this  step.  Two  different 
methods  were  chosen  to  test  the  features  one  at  time  to  find  the  best  30.  The  first  method 
was  using  the  KNN  classifier  to  classify  the  data  files  using  one  feature  at  a  time.  It  was 
decided  to  use  a  Fuzzy  version  of  K-nearest  neighbor  algorithm.  The  value  5  was  selected 
for  the  K  because  it  seemed  that  it  gave  better  results  than  the  other  values  for  1  feature 
classification  .  Also  a  threshold  of  0.5  was  used  to  defuzzify  the  output  of  the  classifier. 
Refer  to  the  section  on  classification  for  the  reason  of  choosing  this  classifier.  The  second 
method  was  using  the  scatter  criterion  is  given  below. 

(1) 

S'  +5; 

w,  =  mean  of  class  i,  5,  =  standard  deviation  of  class  i 

This  criterion  measures  the  distance  between  the  means  of  the  two  classes,  normalized 
over  the  sum  of  the  variances.  Therefore  the  more  compactly  the  samples  in  each  class  re 
separated,  the  higher  will  be  the  value  of  J. 

The  two  methods  were  run  on  three  sets  of  data.  At  this  point  a  method  was  needed  to 
choose  the  features.  Different  methods  are  possible  for  this  step.  The  method  that  was 
followed  is  shown  in  figure  [6]  and  explained  below.  • 

At  first  the  results  of  KNN  and  scatter  criterions  were  averaged  for  3  sets  of  data  so  that 
features  that  work  well  for  all  data  sets  would  be  selected.  As  mentioned  in  an  earlier 
section  for  Basic  features  1  to  93,  7  features  and  for  the  features  94  to  99,  3  features  were 
derived.  Because  these  features  are  derived  from  one  basic  feature  and  are  strongly 
correlated,  it  was  decided  to  choose  only  one  from  them.  So  the  best  feature  from  these 
sets  of  3  or  7  was  selected,  and  the  results  were  sorted. 
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Two  sets  of  30  features  were  found  using  the  above  mentioned  criterions.  The  next  step 
was  choosing  30  features  from  these  60.  This  was  done  by  examining  the  tables  and 
selecting  the  features  that  showed  a  good  performance  in  both  cases  or  had  a  special 
physical  meaning. 

This  set  of  features  is  the  final  set  used  for  examining  and  selection.  Table  1  in  Appendix 
A  shows  these  features  with  their  corresponding  meaning,  channel  used  to  derive  the 
feature,  and  the  method  to  combine  the  features  for  different  questions. 
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m.3.3.2  Feature  Selection  (Combination) 


The  number  of  features  was  reduced  to  30  in  the  Feature  Reduction  step.  This  number 
should  be  further  reduced  because  there  is  100  samples  in  each  data  file,  and  using  30 
features  in  a  classifier  might  give  very  good  results  for  that  particular  data  set,  but  it  wont 
be  able  to  generalize.  At  this  step  measuring  the  performance  of  individual  features  is  not 
a  very  logical  method.  Because  for  example  features  'A'  and  ''B'  might  be  good  features 
individually,  but  combining  them  might  not  necessarily  give  better  results.  Whereas 
feature  'C  that  might  not  be  a  very  good  feature  by  itself  might  improve  the  classification 
if  combined  with  feature  'A'. 

Therefore  the  combinations  of  the  features  should  be  examined.  Many  methods  are 
suggested  to  solve  this  problem.  The  most  basic  way  is  exhaustive  search.  That  is  tiying 
all  the  combinations  for  these  features.  It  is  obvious  that  this  is  not  practical  when  the 
number  of  features  is  not  very  small.  For  example  choosing  10  or  less  features  fi’om  a  set 
of  30  and  trying  all  the  different  combinations  needs 


10  10 

1=1  1=1 


30.' 


i!(30-i)! 


»10* 


computations. 

The  method  that  was  chosen  was  to  start  with  all  the  combinations  of  two,  find  the  best  N 
ones  among  them,  and  use  only  these  combinations  to  combine  features  in  sets  of  3.  Then 
again  find  the  best  combinations  of  3  and  use  them  in  combinations  of  4  features. 

This  procedure  is  continued  until  satisfactory  results  are  gained  or  features  are  not 
improved  by  increasing  the  number  of  features.  Figure  [7]  shows  the  algorithm  for  this 

step. 
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All  pairwise  combinations  of  the  features  were  tried  to  see  the  classification  results.  The 
classifier  used  was  Fuzzy  K-nearest  neighbor  with  a  threshold  of  0.5,  and  K=5.  This  was 
done  for  three  sets  of  features.  The  results  were  sorted  and  30  best  combinations  for  each 
set  were  found.  Also  the  results  of  classification  for  each  combination  for  the  3  sets  was 
averaged  and  the  30  combinations  that  gave  best  results  on  the  average  were  found. 

These  combinations  are  shown  in  Table  2  in  Appendix  A. 

It  was  decided  to  select  20  sets  of  pairwise  combinations  to  use  in  combinations  of  3. 
Results  for  sets  1-3  and  Average  were  studied  and  combinations  that  showed  a  good 
result  in  one  of  the  sets  or  had  a  good  average  were  selected.  Table  3  in  Appendix  A 
shows  these  combinations. 

The  same  steps  were  repeated  to  study  the  combinations  of  3  and  4  features.  The  results 
are  shown  in  Tables  4  and  6  in  Appendix  A.  Because  of  time  limitations  it  was  decided 
not  to  go  further  from  combinations  of  4  features. 
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III.3.4  Discussion  about  the  results: 


The  classification  results  improved  consistently  by  increasing  the  number  of  features  fi’om 
one  to  four.  The  features  that  showed  the  best  result  for  the  three  sets  were  features  {5, 

9,  21,  23}  with  81  percent  correct  classification.  These  features  represent  Maximum  Of 
GSR,  Difference  between  Maximum  and  Minimum  of  High  Cardio,  Maximum  of  Lower 
Respiratory,  and  the  Difference  between  Maximum  and  Minimum  of  Upper  Respiratory. 
These  features  show  approximately  the  same  classification  results  for  all  three  sets  which 
is  81  percent. 

Other  combinations  of  features  also  gave  comparable  results.  For  example  (5,  21,  23,  29} 
and  {5,  11,  21,  23},  and  {5,  10,  21,  23}.  Note  the  repetition  of  {5,  21,  23}.  Refer  to  the 
table  1  in  Appendix  A  for  a  meaningful  listing  of  the  features.  It  is  very  notable  that 
feature  sets  that  show  the  best  classification  results  has  features  that  come  fi'om  different 
channels.  It  can  be  concluded  that  signals  from  different  physiological  channels  convey 
independent  information,  so  that  using  features  extracted  from  them  improves  the 
classification. 

Another  point  to  notice  is  that  data  set  three  shows  better  classification  results  than  the 
two  other  sets,  87  percent  versus  81  percent  for  the  sets  one  and  two.  The  feature  set  that 
gives  the  best  result  for  data  set  three  is  (9,  14,  19,  24}.  This  feature  set  gives  87.4 
percent  correct  classification  for  data  set  three.  The  feature  set  {5,  9,  21,  23}  that  gives 
the  best  classification  on  the  average,  has  approximately  the  same  results  for  all  three  sets, 
81  percent.  The  polygraph  tests  that  were  used  in  this  project  came  fi'om  several  sources 
and  were  done  by  different  examiners  that  used  slightly  different  methods.  Fifty 
consecutive  tests  were  used  to  build  each  data  set.  So  it  is  possible  that  some 
characteristic  exists  in  the  deceptive  files  of  data  set  three  that  results  in  better 
classification.  This  is  a  matter  of  future  investigation. 
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III.4.  Classification 


The  classifier  is  the  final  stage  in  a  pattern  recognition  system.  The  inputs  to  the  classifier 
are  usually  a  set  of  feature  vectors.  The  classifier  ordinarily  assigns  each  input  to  one  of 
the  classes.  There  are  many  methods  to  design  a  classifier.  The  classifier  could  be 
designed  after  studying  the  distribution  of  samples  of  each  class,  or  a  learning 
classification  algorithm  can  be  implemented.  We  were  not  sure  about  the  shape  of 
clustering  and  the  distribution  of  samples  for  deceptive  and  non  deceptive  classes,  and  it 
was  possible  that  samples  for  one  class  cluster  around  more  than  one  point  in  space.  It 
was  decided  to  use  the  K-nearest  neighbor  classifier*  in  this  project  because  it  does  not 
explicitly  use  the  distribution  of  the  samples. 

One  of  the  characteristics  of  the  conventional  classification  methods  is  that  they  assign 
each  input  to  one  of  the  possible  classes  (crisp  Classification)  or  find  probability 
distributions  of  belongingnesses  of  the  inputs  to  the  classes.  While  the  way  that  humans 
think  and  classify  objects  is  fundamentally  different.  Each  object  can  be  considered  to 
belong  to  more  than  one  class  at  the  same  time,  and  there  are  degrees  of  belongingness  for 
each  class.  This  is  the  basic  idea  that  is  followed  in  Fuzzy  Logic.  It  was  decided  to  follow 
a  Fuzzy  Logic  based  classifier  in  this  project,  because  the  output  will  be  the  possibility  of 
deception  and  a  person  will  not  be  considered  completely  deceptive  or  non  deceptive. 

Conventional  K-nearest  neighbor  algorithm  and  a  Fuzzy  version  of  it  are  described  in  the 
following  two  sections. 


*  We  are  indebted  to  Professor  R.  Duda  for  suggesting  KNN  classifier. 
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in.4.1.  K-Nearest  Neighbor  Algorithm 


K-Nearest  neighbor  algorithm  is  a  supervised  classification  method.  There  is  no  need  for 
the  training  or  adjusting  the  classifier.  A  set  of  labeled  input  samples  is  given  to  the 
classifier.  When  a  new  sample  is  ^ven  to  the  system,  it  finds  its  K  nearest  neighboring 
samples,  and  assigns  this  sample  to  the  class  that  the  majority  of  the  neighbors  belong  to. 
K  could  be  any  positive  integer.  When  K  is  set  to  1,  the  algorithm  is  called  the  nearest 
neighbor  algorithm.  In  this  case  each  new  sample  is  assigned  to  the  class  of  its  nearest 
neighbor.  If  K  is  greater  than  1,  it  is  possible  that  there  is  no  majority  class.  To  remove 
this  tie,  the  sum  of  the  distances  of  the  new  sample  to  its  neighbors  in  each  class  is 
computed  and  the  sample  is  assigned  to  the  class  that  has  the  minimum  distance.  The 
main  advantage  of  using  this  method  is  that  the  samples  of  each  class  are  not  needed  to 
cluster  in  a  pre  specified  shape.  For  example  for  a  two  class  classification,  the  K-nearest 
neighbor  classifier  can  still  give  very  good  results  if  the  samples  of  each  class  are  clustered 
in  two  distinct  points  in  the  space.  The  algorithm  for  the  K  nearest  neighbor  is  shown  in 
figure  8.  It  is  supposed  that  C  is  the  number  of  classes,  K  is  the  number  of  neighbors  in 
KNN,  X,  is  the  \th  labeled  sample  and  y  is  the  input  to  be  classified. 
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Figure  8.  K  Nearest  Neighbor  Algorithm 


III.4.2.  Fuzzy  K  Nearest  Neighbor  Algorithm 


The  fuzzy  K  nearest  neighbor  algorithm  uses  the  same  idea  of  conventional  K  nearest 
neighbor  algorithm,  that  is  finding  the  K  samples  that  are  closest  to  sample  to  be  classified. 
But  there  is  a  conceptual  difference  in  classification.  When  fuzzy  classification  is  used,  the 
input  is  not  assigned  to  a  single  class.  Instead,  the  degree  of  belon^ngness  of  the  input  to 
each  class  is  determined  by  the  classifier.  By  using  this  method  more  information  is 
obtained  about  the  input.  For  example  if  the  result  of  classification  determines 
membership  of  an  input  to  class  A  is  0.9  and  to  class  B  is  0.1,  it  means  the  input  belongs 
to  class  A  with  a  very  good  possibility.  But  if  the  membership  to  class  A  is  0.55  and  to 
class  B  is  0.45,  it  means  that  we  cannot  be  very  sure  about  the  classification  of  the  input. 

If  the  crisp  classifier  is  used,  in  both  cases  the  input  will  be  assigned  to  class  A  and  no 
further  information  is  obtained. 

Refer  to  [14,  1 5]  for  more  detailed  discussions  about  fuzzy  K  nearest  neighbor  algorithms. 
The  flowchart  for  a  fuzzy  K  nearest  neighbor  classifier  is  drawn  in  figure  9. 

The  first  step  in  the  fuzzy  K  nearest  neighbor  algorithm  is  the  same  as  first  step  in  crisp 
classifier.  In  both  cases  K  nearest  neighbors  of  the  input  are  found.  While  in  crisp 
classifier  the  majority  class  of  the  neighbors  is  assigned  to  the  input,  in  Fuzzy  classifier 
membership  of  the  input  to  each  class  should  be  found.  In  order  to  do  so  the  membership 
vector  of  each  sample  is  combined  to  obtain  the  membership  vector  of  the  input.  If  the 
samples  are  crisply  classified,  membership  vectors  should  be  assigned  to  them.  One 
method  to  do  so  is  to  assign  the  membership  of  1  to  the  class  that  it  belongs  to,  and 
membership  of  0  to  other  classes.  Other  methods  assign  different  memberships  to  the 
samples  according  to  its  distance  from  the  mean  of  the  class,  or  the  distances  fi-om  the 
nearby  samples  of  its  own  class  and  the  other  classes. 


When  the  membership  vectors  of  the  labeled  samples  are  specified,  they  are  combined  to 
find  the  membership  vector  of  the  unknown  class.  This  procedure  should  be  done  in  a 
way  that  samples  that  are  closer  to  the  input  have  more  effect  on  the  resultant  membership 
function,  The  following  formula  uses  the  inverse  distance  to  weigh  the  membership 
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functions,  x  is  the  input  to  be  classified,  is  the  nearest  neighbor  and  is  the 
membership  of  the  j//;  nearest  neighbor  of  the  input  in  class  i.  D(x,y)  is  a  distance  measure 
between  the  vectors  x  and  y  which  could  be  the  Euclidean  distance. 

m  is  a  parameter  that  changes  the  weighing  effect  of  the  distance.  When  ttj  » 1,  all  the 
samples  will  have  the  same  weight.  When  m  approaches  1,  the  nearest  samples  have  much 
more  effect  on  the  membership  value  of  the  input. 
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Figure  9.  Fuzzy  K-Nearest  Neighbor 
Algorithm 
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III.4.3.  Methods  and  Discussion: 


As  mentioned  in  an  earlier  section  the  classifier  was  needed  to  compare  the  effectiveness 
of  single  features  and  to  choose  the  combinations  of  the  features  that  gave  the  best 
classification  results.  Therefore,  the  classifier  was  selected  and  used  before  the  final 
feature  set  was  determined.  The  classifier  might  change  the  results  of  the  classification 
and  finding  the  best  classifier  is  not  a  trivial  task.  For  example  using  the  value  of  10  for  K 
may  change  the  set  of  30  best  features  that  was  found  by  using  K  =  5. 

It  is  not  practical  to  try  all  different  cases  for  different  classifiers  and  different  parameters 
of  classifiers,  so  it  was  decided  to  use  a  classifier  with  fixed  parameters  up  to  the  point 
that  final  set  of  features  were  selected.  The  classifier  as  mentioned  earlier  was  a  Fuzzy  K- 
nearest  neighbor  with  the  following  parameters: 

K  =  5, 

m  =  2, 

Defuzzification  threshold  =  0.5; 

It  should  be  noted  that  in  order  to  save  computation  time  throughout  this  project,  each  set 
of  files  was  randomly  broken  into  a  training  and  a  testing  set.  Each  file  in  the  testing  set 
was  classified  using  the  labeled  files  in  training  set.  Each  experiment  was  repeated  20 
times,  and  the  results  were  averaged.  The  number  of  files  that  were  used  for  training  and 
testing  were  accordingly  75  and  25.  In  the  last  stage  of  experiments  after  the  final  feature 
set  had  been  fixed,  instead  of  randomly  selecting  testing  and  training  files,  one  file  was 
kept  for  testing  each  time  and  the  experiment  was  repeated  1 00  times  changing  the  test 

file. 

After  the  final  feature  set  was  selected  (Refer  to  the  section  on  Feature  Extraction), 
different  values  for  K  were  tried  on  fuzzy  and  crisp  classifier  to  compare  the  two 
classifiers  and  find  the  best  parameters.  In  addition  to  percentage  of  correct  classification 
a  measure  of  performance  was  also  used  which  is  explained  below. 

The  measure  that  is  used  to  compare  the  performance  of  fuzzy  classifier  is  the  root  mean 
square  of  the  distances  between  the  output  of  the  classifier  and  the  correct  class.  The 
correct  ouput  of  the  classifer  should  be  0  for  non-deceptive  cases  and  1  for  the  deceptive 
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ones.  For  example  if  for  a  deceptive  sample  the  classifier  output  is  0.8,  0.2  is  the  distance 
between  the  output  and  the  correct  class.  The  same  measure  is  used  for  the  crisp 
classifier.  In  the  case  of  the  crisp  classifier  the  distance  is  always  0  for  correct 
classification  and  1  for  incorrect  classification. 

For  the  fuzzy  classifier  the  threshold  used  for  defuzzification  was  also  changed  to  find  the 
optimum  value.  Tables  7  and  8  in  Appendix  A  show  the  results.  The  best  classification  on 
the  average  over  three  sets  is  obtained  using  the  fuzzy  classifier  with  K  =  6,  and  threshold 
=  0.6  .  Using  this  values  correct  classification  of  81.6  percent  was  achieved.  The  best 
result  using  the  crisp  classifier  was  80.6  percent  which  was  obtained  using  K=6.  The 
performance  measures  for  the  fiizzy  and  crisp  classifiers  were  accordingly  0.3915  and 
0.4377  which  shows  fuzzy  classifier  has  a  better  performance  in  this  respect. 

One  final  experiment  that  was  done  is  explained  below.  In  a  Polygraph  examination  a  set 
of  questions  is  repeated  one  to  five  times  and  the  decision  is  made  by  considering  the 
responses  to  all  these  charts.  In  this  project  each  chart  was  classified  separately.  As  the 
final  experiment  responses  to  all  the  charts  in  a  Polygraph  examination  were  combined  and 
classified  as  deceptive  or  non-deceptive.  The  way  they  were  combined  was  finding  the 
majority  class  and  assigning  the  case  to  that  class.  In  the  case  that  equal  number  of  files 
classified  as  deceptive  and  non-deceptive,  the  membership  function  of  the  files  was 
averaged  and  the  case  was  classified  according  to  this  value.  The  classification  results  for 
all  the  files  in  sets  1  to  3  are  shown  in  Table  9  in  Appendix  A.  The  number  of  cases  in 
each  set  was  35.  The  number  of  misclassified  cases  in  sets  1  to  3  are  5,  7,  and  3,  which 
correspond  to  correct  classifications  of  85.7,  80.0,  and  91.4  percent. 
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rv.  Conclusion  and  future  work 


The  set  of  four  features  that  showed  best  classification  results  in  this  project  were 
Maximum  of  GSR,  Upper  Respiration  and  Lower  respiration  signals,  and  the  difference 
between  the  Maximum  and  Minimum  of  High  Cardio  signal.  These  are  all  very  simple 
time  domain  features.  The  best  classification  was  obtained  using  the  fuzzy  classifier  with 
K  =  6,  and  threshold  =  0.6  .  Using  this  values  correct  classification  of  81.6  percent  was 
achieved.  By  combining  all  the  files  in  a  Polygraph  examination  85.7  percent  correct 
classification  was  achieved  on  the  average. 

There  are  several  suggestions  for  the  future  work.  First  is  to  repeat  this  work  with  larger 
sets  of  data  files  and  observe  the  generalizability  of  the  feature  sets  obtained  in  this 
research.  A  possible  way  to  improve  the  results  is  to  change  time  fi'ames  used  to  extract 
each  feature  for  every  question.  In  this  way  the  optimum  time  for  obtdning  a  response 
could  be  found.  Another  suggestion  is  to  try  different  methods  for  fuzzification  and 
defuzzification  of  feature  vectors  to  optimize  the  fuzzy  classifier. 
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Appendices 


Appendix  A: 
Tables 


No. 

feature 

Description 

Channel 

Method 

1 

lOmeaB 

mean 

GSR 

1 

2 

lOcurv'e 

curx’e  length 

GSR 

2 

3 

lOmed  dif 

median  of  the  derivative 

GSR 

1 

4 

10ma.\  min 

minimum  subtracted  from  the  maximum 

GSR 

2 

5 

lOmax 

maximum  of  the  signal 

GSR 

1 

6 

lOmdif 

mean  of  derivative 

GSR 

3 

7 

20ciirv’e 

curve  length 

High  Cardio 

1 

8 

High  Cardio 

1 

9 

20max  min 

minimum  subtracted  from  the  maximum 

High  Cardio 

4 

10 

20ma\ 

maximum  of  the  signal 

High  Cardio 

4 

11 

20inin 

minimum  of  the  signal 

High  Cardio 

1 

12 

30med  dif 

median  of  the  derivative 

Loxv  Cardio 

3 

13 

30ma\ 

maximum  of  the  signal 

Low  Cardio 

1 

14 

40mean 

mean 

Derivative  of  Loxv  Cardio 

1 

15 

40max 

maximum  of  the  signal 

Derivative  of  Low  Cardio 

1 

16 

50cun^e 

cuixe  length 

6 

17 

2 

18 

number  of  the  peaks 

M  lAiy J 1  Jail  >11 

5 

19 

50ie 

inhalation  divided  bv  exhalation 

5 

20 

50max  min 

minimum  subtracted  from  the  maximum 

2 

21 

50max 

maximum  of  the  signal 

6 

22 

60max  min 

minimum  subtracted  from  the  maximum 

2 

23 

60max 

maximum 

3 

24 

lOstd 

standard  dexiation 

GSR 

2 

25 

20std 

standard  dexiation 

High  Cardio 

1 

26 

50std 

Standard  de\iation 

6 

27 

20armodl 

auto  regressive  parameter 

High  Cardio 

7 

28 

max  cross  spectral  density- 

High  Cardio,  Loxx'er  Respirator^' 

1 

29 

lOisdl 

frequencj’  of  maximum  integrated  spectral 
difference  of  control-relevant  pair 

GSR 

1* 

30 

20isdl 

area  under  integrated  spectral  difference 

High  Cardio 

3* 

Methods;  l=Difference  of  Averages,  2=Nonnalized  Average,  3=Max-Max,  4=Min-Min, 
5=Max-Min,  6=Min-Max,  7=Max/Min ,  1*= Average  of  relevant-control  pairs,  3*=Max  of  relevant- 
control  pair. 


Table  1.  Selected  Features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

74.2000 

8.0000 

18.0000 

74.0000 

10.0000 

21.0000 

73.0000 

5.0000 

7.0000 

72.0000 

24.0000 

26.0000 

71.8000 

23.0000 

24.0000 

71.6000 

4.0000 

26.0000 

70.4000 

25.0000 

26.0000 

70.4000 

18.0000 

25.0000 

70.2000 

24.0000 

27.0000 

70.2000 

9.0000 

21.0000 

70.0000 

5.0000 

27.0000 

69.6000 

11.0000 

21.0000 

69.6000 

9.0000 

24.0000 

69.4000 

11.0000 

27.0000 

69.4000 

5.0000 

26.0000 

69.2000 

8.0000 

19.0000 

69.2000 

5.0000 

18.0000 

69.0000 

25.0000 

27.0000 

69.0000 

9.0000 

18.0000 

69.0000 

5.0000 

23.0000 

68.8000 

24.0000 

30.0000 

68.8000 

18.0000 

20.0000 

68.8000 

17.0000 

20.0000 

68.8000 

4.0000 

15.0000 

68.6000 

22.0000 

24.0000 

68.4000 

6.0000 

24.0000 

68.4000 

1.0000 

27.0000 

68.2000 

15.0000 

24.0000 

68.2000 

9.0000 

26.0000 

68.2000 

5.0000 

19.0000 

Table  [2.1]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

74.4000 

5.0000 

23.0000 

74.4000 

4.0000 

27.0000 

74.2000 

4.0000 

15,0000 

74.0000 

20.0000 

24.0000 

73.6000 

16.0000 

24.0000 

73.2000 

3.0000 

27.0000 

72.8000 

27.0000 

30.0000 

72.6000 

4.0000 

30.0000 

72.6000 

4.0000 

7.0000 

72.4000 

5.0000 

25.0000 

72.2000 

24.0000 

30.0000 

72.2000 

8.0000 

27.0000 

72.2000 

4.0000 

17.0000 

72.2000 

4.0000 

16.0000 

72.0000 

24.0000 

27.0000 

72.0000 

24.0000 

25.0000 

72.0000 

4.0000 

20.0000 

71.8000 

7.0000 

23.0000 

71.8000 

4.0000 

10.0000 

71.2000 

25.0000 

27.0000 

70.8000 

24.0000 

26.0000 

70.8000 

8.0000 

22.0000 

70.6000 

7,0000 

27.0000 

70.6000 

6.0000 

27.0000 

70.4000 

14.0000 

21.0000 

70.4000 

14.0000 

20.0000 

70.4000 

4.0000 

8.0000 

70.2000 

4.0000 

24.0000 

70.0000 

22.0000 

27.0000 

70.0000 

17.0000 

24.0000 

Table  [2.2]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Percent  correct 

Feature  1 

Feature  2 

81.0000 

1.0000 

10.0000 

80.6000 

9.0000 

24.0000 

80.4000 

10.0000 

24.0000 

80.4000 

4.0000 

25.0000 

80.2000 

4.0000 

9.0000 

79.8000 

5.0000 

11.0000 

79.2000 

17.0000 

24.0000 

79.2000 

1.0000 

21.0000 

79.2000 

1.0000 

8.0000 

79.0000 

1.0000 

24.0000 

79.0000 

1.0000 

11.0000 

78.8000 

4.0000 

11.0000 

78.6000 

4.0000 

17.0000 

78.2000 

24.0000 

25.0000 

78.2000 

1.0000 

14.0000 

78.0000 

1.0000 

23.0000 

78.0000 

1.0000 

20.0000 

77.8000 

23.0000 

24.0000 

77.8000 

1.0000 

5.0000 

77.6000 

19.0000 

24.0000 

77.4000 

11.0000 

24.0000 

77.4000 

5.0000 

18.0000 

77.2000 

4.0000 

19.0000 

77.0000 

4.0000 

18.0000 

76.8000 

4.0000 

15.0000 

76.6000 

5.0000 

13.0000 

76.6000 

4.0000 

24.0000 

76.2000 

4.0000 

5.0000 

76.2000 

1.0000 

26.0000 

Table  [2.3]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  average 


Percent  correct 

Feature  1 

Feature  2 

73.2667 

4.0000 

15.0000 

72,8000 

24.0000 

26.0000 

72.6667 

4.0000 

17.0000 

72.6000 

5.0000 

23.0000 

72.2667 

23.0000 

24.0000 

72.0667 

24.0000 

30.0000 

71.9333 

20.0000 

24.0000 

71,8667 

24.0000 

27.0000 

71.4667 

24.0000 

25.0000 

71,4000 

4.0000 

26.0000 

71.0667 

4.0000 

10.0000 

70.9333 

1.0000 

8.0000 

70.9333 

4.0000 

23.0000 

70.6000 

5.0000 

11.0000 

70,6000 

4.0000 

24.0000 

70.5333 

9.0000 

24.0000 

70.4667 

6.0000 

24.0000 

70.4667 

4.0000 

25.0000 

70.4667 

4.0000 

19.0000 

70.4000 

4.0000 

30.0000 

70.3333 

1.0000 

23.0000 

70.0667 

17.0000 

24.0000 

70.0667 

1.0000 

24.0000 

70.0000 

16.0000 

24.0000 

69.9333 

4.0000 

9.0000 

69.8667 

4.0000 

20.0000 

69.8667 

5.0000 

7.0000 

69.8667 

4.0000 

7.0000 

69.8000 

15.0000 

24.0000 

69.8000 

1.0000 

21.0000 

Table  [2.4]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.4000 

10.0000 

21.0000 

26.0000 

77.6000 

5.0000 

7.0000 

23.0000 

77.6000 

5.0000 

23.0000 

11.0000 

77.4000 

5.0000 

23.0000 

21.0000 

76.4000 

16.0000 

24.0000 

18.0000 

76.4000 

5.0000 

23.0000 

19.0000 

75.8000 

23.0000 

24.0000 

19.0000 

75.8000 

23.0000 

24.0000 

15.0000 

75.8000 

5.0000 

23.0000 

7.0000 

75.6000 

5.0000 

7.0000 

22.0000 

75.6000 

5.0000 

7.0000 

21.0000 

75.6000 

5.0000 

7.0000 

16.0000 

75.4000 

5.0000 

7.0000 

14.0000 

75.4000 

5.0000 

11.0000 

10.0000 

75.2000 

10.0000 

21.0000 

19.0000 

75.2000 

8.0000 

18.0000 

6.0000 

75.2000 

5.0000 

23.0000 

2.0000 

75.0000 

10.0000 

21.0000  1 

16.0000 

75.0000 

10.0000 

21.0000 

8.0000 

75.0000 

5.0000 

11.0000 

18.0000 

75.0000 

4.0000 

26.0000 

14.0000 

75.0000 

5.0000 

23.0000 

29.0000 

75.0000 

5.0000 

23.0000 

25.0000 

74.8000 

10.0000 

21.0000 

9.0000 

74.6000  i 

10.0000 

21.0000 

12.0000 

74.6000 

5.0000 

11.0000 

23.0000 

74.6000 

10.0000 

24.0000 

9.0000 

74.6000 

5.0000 

23.0000 

10.0000 

74.6000 

5.0000 

23.0000 

9.0000 

74.4000 

5.0000 

7.0000 

19.0000 

Table  [4.1]  Results  of  combinations  of  3  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.8000 

20.0000 

24.0000 

12.0000 

78.6000 

24.0000 

30.0000 

19.0000 

78.6000 

4.0000 

15.0000 

28.0000 

78.0000 

24.0000 

27.0000 

19.0000 

77.8000 

4.0000 

17.0000 

19.0000 

77.6000 

8.0000 

18.0000 

4.0000 

77.4000 

4.0000 

27.0000 

19.0000 

77.4000 

5.0000 

23.0000 

21.0000 

77.2000 

5.0000 

23.0000 

29.0000 

77.2000 

4.0000 

15.0000 

27.0000 

77.0000 

4.0000 

27.0000 

18.0000 

77.0000 

4.0000 

15.0000 

21.0000 

76.6000 

5.0000 

7.0000 

23.0000 

76.6000 

20.0000 

24.0000 

3.0000 

76.4000 

16.0000 

24.0000 

30.0000 

76.4000 

4.0000 

27.0000 

25.0000 

76.4000 

24.0000 

27.0000 

10.0000 

76.4000 

23.0000 

24.0000 

30.0000 

76.2000 

5.0000 

23.0000 

3.0000 

76.2000 

4.0000 

17.0000 

2.0000 

76.2000 

4.0000 

15.0000 

26.0000 

75.8000 

5.0000 

7.0000 

15.0000 

75.8000 

24.0000 

30.0000 

4.0000 

75.8000 

5.0000 

23.0000 

28.0000 

75.6000 

4.0000 

27.0000 

15.0000 

75.6000 

24.0000 

27.0000 

26.0000 

75.6000 

24.0000 

27.0000 

1.0000 

75.6000 

20.0000 

24.0000 

25.0000 

75.6000 

24.0000 

30.0000 

16.0000 

75.4000 

4.0000 

15.0000 

8.0000 

Table  [4.2]  Results  of  combinations  of  3  features 


Appendices 


Appendix  A: 
Tables 


No. 

feature 

Description 

Channel 

Method 

1 

lOmean 

mean 

GSR 

1 

2 

lOcuA’e 

cun'e  length 

GSR 

2 

3 

lOmed  dif 

median  of  the  derivative 

GSR 

1 

4 

lOmax  min 

minimum  subtracted  from  the  maximum 

GSR 

2 

5 

lOmax 

maximum  of  the  signal 

GSR 

1 

6 

lOmdif 

mean  of  derivative 

GSR 

3 

7 

20cur\’e 

curve  length 

High  Cardio 

1 

8 

20ampcard 

High  Cardio 

1 

9 

20max  min 

minimum  subtracted  from  the  maximum 

High  Cardio 

4 

10 

20max 

maximum  of  the  signal 

High  Cardio 

4 

11 

20min 

minimum  of  the  signal 

High  Cardio 

1 

12 

30med_  dif 

median  of  the  derivative 

Low  Cardio 

3 

13 

30max 

maximum  of  the  signal 

Low  Cardio 

1 

14 

40mean 

mean 

Derivative  of  Low  Cardio 

1 

15 

40max 

maximum  of  the  signal 

Derivative  of  Low  Cardio 

1 

16 

50cuA’e 

curve  length 

6 

17 

2 

18 

number  of  the  peaks 

Lower  Respiratory 

5 

19 

50ie 

inhalation  di\ided  by  exhalation 

5 

20 

50max  min 

minimum  subtracted  from  the  maximum 

Lower  Respiratory 

2 

21 

50ma\ 

maximum  of  the  signal 

6 

22 

60max  min 

minimum  subtracted  from  the  maximum 

2 

23 

60ma\ 

maximum 

3 

24 

lOstd 

standard  de\iation 

GSR 

2 

25 

20std 

standard  de\iation 

High  Cardio 

1 

26 

50std 

standard  delation 

6 

27 

20armodl 

auto  regressive  parameter 

High  Cardio 

7 

28 

High  Cardio,  Lower  Respiratory 

1 

29 

lOisdl 

frequencj’  of  maximum  integrated  spectral 
difference  of  control-relevant  pair 

GSR 

1* 

30 

20isdl 

area  under  integrated  spectral  difference 

High  Cardio 

3* 

Methods:  l=Difference  of  Averages,  2=Nonnalized  Average,  3=Max-Max,  4=Mirj-Miii, 
5=Max-Min,  6=Min-Max,  7=Max/Min ,  l*=Average  of  relevant-control  pairs,  3*=Max  of  relevant- 
control  pair. 


Table  1.  Selected  Features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

74.2000 

8.0000 

18.0000 

74.0000 

10.0000 

21.0000 

73.0000 

5.0000 

7.0000 

72.0000 

24.0000 

26.0000 

71.8000 

23.0000 

24.0000 

71.6000 

4.0000 

26.0000 

70.4000 

25.0000 

26.0000 

70.4000 

18.0000 

25.0000 

70.2000 

24.0000 

27.0000 

70.2000 

9.0000 

21.0000 

70.0000 

5.0000 

27.0000 

69.6000 

11.0000 

21.0000 

69.6000 

9.0000 

24.0000 

69.4000 

11.0000 

27.0000 

69.4000 

5.0000 

26.0000 

69.2000 

8.0000 

19.0000 

69.2000 

5.0000 

18.0000 

69.0000 

25.0000 

27.0000 

69.0000 

9.0000 

18.0000 

69.0000 

5.0000 

23.0000 

68.8000 

24.0000 

30.0000 

68.8000 

18.0000 

20.0000 

68.8000 

17.0000 

20.0000 

68.8000 

4.0000 

15.0000 

68.6000 

22.0000 

24.0000 

68.4000 

6.0000 

24.0000 

68.4000 

1.0000 

27.0000 

68.2000 

15.0000 

24.0000 

68.2000 

9.0000 

26.0000 

68.2000 

5.0000 

19.0000 

Table  [2.1]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

74.4000 

5.0000 

23.0000 

74.4000 

4.0000 

27.0000 

74.2000 

4.0000 

15.0000 

74.0000 

20.0000 

24.0000 

73.6000 

16.0000 

24.0000 

73.2000 

3.0000 

27.0000 

72.8000 

27.0000 

30.0000 

72.6000 

4.0000 

30.0000 

72.6000 

4.0000 

7.0000 

72.4000 

5.0000 

25.0000 

72.2000 

24.0000 

30.0000 

72.2000 

8.0000 

27.0000 

72.2000 

4.0000 

17.0000 

72.2000 

4.0000 

16.0000 

72.0000 

24.0000 

27.0000 

72.0000 

24.0000 

25.0000 

72.0000 

4.0000 

20.0000 

71.8000 

7.0000 

23.0000 

71.8000 

4.0000 

10.0000 

71.2000 

25.0000 

27.0000 

70.8000 

24.0000 

26.0000 

70.8000 

8.0000 

22.0000 

70.6000 

7.0000 

27.0000 

70.6000 

6.0000 

27.0000 

70.4000 

14.0000 

21.0000 

70.4000 

14.0000 

20.0000 

70.4000 

4.0000 

8.0000 

70.2000 

4.0000 

24.0000 

70.0000 

22.0000 

27.0000 

70.0000 

17.0000 

24.0000 

Table  [2.2]  Results  of  pairwise  combinations  of  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Percent  correct 

Feature  1 

Feature  2 

81.0000 

1.0000 

10.0000 

80.6000 

9.0000 

24.0000 

80.4000 

10.0000 

24.0000 

80.4000 

4.0000 

25.0000 

80.2000 

4.0000 

9.0000 

79.8000 

5.0000 

11.0000 

79.2000 

17.0000 

24.0000 

79.2000 

1.0000 

21.0000 

79.2000 

1.0000 

8.0000 

79.0000 

1.0000 

24.0000 

79.0000 

1.0000 

11.0000 

78.8000 

4.0000 

11.0000 

78.6000 

4.0000 

17.0000 

78.2000 

24.0000 

25.0000 

78.2000 

1.0000 

14.0000 

78.0000 

1.0000 

23.0000 

78.0000 

1.0000 

20.0000 

77.8000 

23.0000 

24.0000 

77.8000 

1.0000 

5.0000 

77.6000 

19.0000 

24.0000 

77.4000 

11.0000 

24.0000 

77.4000 

5.0000 

18.0000 

77.2000 

4.0000 

19.0000 

77.0000 

4.0000 

18.0000 

76.8000 

4.0000 

15.0000 

76.6000 

5.0000 

13.0000 

76.6000 

4.0000 

24.0000 

76.2000 

4.0000 

5.0000 

76.2000 

1.0000 

26.0000 

Table  [2.3]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  classification  for  30  best  combinations  in  average 


Percent  correct 

Feature  1 

Feature  2 

73.2667 

4.0000 

15.0000 

72.8000 

24.0000 

26.0000 

72.6667 

4.0000 

17.0000 

72.6000 

5.0000 

23.0000 

72.2667 

23.0000 

24.0000 

72.0667 

24.0000 

30.0000 

71.9333 

20.0000 

24.0000 

71.8667 

24.0000 

27.0000 

71.4667 

24.0000 

25.0000 

71.4000 

4.0000 

26.0000 

71.0667 

4.0000 

10.0000 

70.9333 

1.0000 

8.0000 

70.9333 

4.0000 

23.0000 

70.6000 

5.0000 

11.0000 

70.6000 

4.0000 

24.0000 

70.5333 

9.0000 

24.0000 

70.4667 

6.0000 

24.0000 

70.4667 

4.0000  1 

25.0000 

70.4667 

4.0000 

19.0000 

70.4000 

4.0000 

30.0000 

70.3333 

1.0000 

23.0000 

70.0667 

17.0000 

24.0000 

70.0667 

1.0000 

24.0000 

70.0000 

16.0000 

24.0000 

69.9333 

4.0000 

9.0000 

69.8667 

4.0000 

20.0000 

69.8667 

5.0000 

7.0000 

69.8667 

4.0000 

7.0000 

69.8000 

15.0000 

24.0000 

69.8000 

1.0000 

21.0000 

Table  [2.4]  Results  of  pairwise  combinations  of  features 
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Percentage  of  correct  ci&asification  for  30  best  combinations  in  sei  i 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.4000 

10.0000 

21.0000 

26.0000 

77.6000 

5.0000 

7.0000 

23.0000 

77.6000 

5.0000 

23.0000 

11.0000 

77.4000 

5.0000 

23.0000 

21.0000 

76.4000 

16.0000 

24.0000 

18.0000 

76.4000 

5.0000 

23.0000 

19.0000 

75.8000 

23.0000 

24.0000 

19.0000 

75.8000 

23.0000 

24.0000 

15.0000 

75.8000 

5.0000 

23.0000 

7.0000 

75.6000 

5.0000 

7.0000 

22.0000 

75.6000 

5.0000 

7.0000 

21.0000 

75.6000 

5.0000 

7.0000 

16.0000 

75.4000 

5.0000 

7.0000 

14.0000 

75.4000 

5.0000 

11.0000 

10.0000 

75.2000 

10.0000 

21.0000 

19.0000 

75.2000 

8.0000  i 

18.0000 

6.0000 

75.2000 

5.0000 

23.0000 

2.0000 

75.0000 

10.0000 

21.0000 

16.0000 

75.0000 

10.0000 

21.0000 

8.0000 

75.0000 

5.0000 

11.0000 

18.0000 

75.0000  1 

4.0000 

26.0000 

14.0000 

75.0000 

5.0000 

23.0000 

29.0000 

75.0000 

5.0000 

23.0000 

25.0000 

74.8000 

10.0000 

21.0000 

9.0000 

74.6000 

10.0000 

21.0000 

12.0000 

74.6000 

5.0000 

11.0000 

23.0000 

74.6000 

10.0000 

24.0000 

9.0000 

74.6000 

5.0000 

23.0000 

10.0000 

74.6000 

5.0000 

23.0000 

9.0000 

74.4000 

5.0000 

7.0000 

19.0000 

Table  [4.1]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

79.8000 

20.0000 

24.0000 

12.0000 

78.6000 

24.0000 

30.0000 

19.0000 

78.6000 

4.0000 

15.0000 

28.0000 

78.0000 

24.0000 

27.0000 

19.0000 

77.8000 

4.0000 

17.0000 

19.0000 

77.6000 

8.0000 

18.0000 

4.0000 

77.4000 

4.0000 

27.0000 

19.0000 

77.4000 

5.0000 

23.0000 

21.0000 

77.2000 

5.0000 

23.0000 

29.0000 

77.2000 

4.0000 

15.0000 

27.0000 

77.0000 

4.0000 

27.0000 

18.0000 

77.0000 

4.0000 

15.0000 

21.0000 

76.6000 

5.0000 

7.0000 

23.0000 

76.6000 

20.0000 

24.0000 

3.0000 

76.4000 

16.0000 

24.0000 

30.0000 

76.4000 

4.0000 

27.0000 

25.0000 

76.4000 

24.0000 

27,0000 

10.0000 

76.4000 

23.0000 

24.0000 

30.0000 

76.2000 

5.0000 

23.0000 

3.0000 

76.2000 

4.0000  n 

17.0000 

2.0000 

76.2000 

4.0000 

15.0000 

26.0000 

75.8000 

5.0000 

7.0000 

15.0000 

75.8000 

24.0000 

30.0000 

4.0000 

75.8000 

5.0000 

23.0000 

28.0000 

75.6000 

4.0000 

27.0000 

15.0000 

75.6000 

24.0000 

27.0000 

26.0000 

75.6000 

24.0000 

27.0000 

1.0000 

75.6000 

20.0000 

24.0000 

25.0000 

75.6000 

24.0000 

30.0000 

16.0000 

75.4000 

4.0000 

15.0000 

8.0000 

Table  [4.2]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

85.2000 

9.0000 

24.0000 

19.0000 

85.0000 

9.0000 

24.0000 

22.0000 

84.2000 

16.0000 

24.0000 

19.0000 

84.0000 

17,0000 

24.0000 

9.0000 

84.0000 

4.0000 

26.0000 

17.0000 

83.6000 

4.0000 

26.0000 

11.0000 

83.6000 

4.0000 

17.0000 

9.0000 

83.6000 

24.0000 

26.0000 

17.0000 

83.6000 

4.0000 

15.0000 

9.0000 

83.4000 

5.0000 

11.0000 

24.0000 

83.4000 

9.0000 

24.0000 

21.0000 

83.4000 

9.0000 

24.0000 

17.0000 

83.4000 

9.0000 

24.0000 

14.0000 

83.4000 

4.0000 

26.0000 

9.0000 

83.2000 

16.0000 

24.0000 

1.0000 

83.2000 

4.0000 

17.0000 

26.0000 

83.2000 

24.0000 

26.0000  1 

9.0000 

83.0000 

9.0000 

24.0000 

12.0000 

83.0000 

9.0000 

24.0000 

6.0000 

83.0000 

4.0000 

17,0000 

11.0000 

82.8000 

9.0000 

24.0000 

18.0000 

82.8000 

23.0000 

24.0000 

1.0000 

82.8000 

4.0000 

17.0000 

24.0000 

82.8000 

4.0000 

17.0000 

8.0000 

82.6000 

17.0000 

24.0000 

19.0000 

82.4000 

17.0000 

24.0000 

8.0000 

82.4000 

9.0000 

24.0000 

2.0000 

82.4000 

5.0000 

23.0000 

29.0000 

82.2000 

5.0000 

23,0000 

10.0000 

82.0000 

9.0000 

24.0000 

26.0000 

Table  [4.3]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  on  average 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

78.2000 

5.0000 

23.0000 

29.0000 

77.6000 

5.0000 

7.0000 

23.0000 

77.3333 

5.0000 

23.0000 

21.0000 

76.6000 

5.0000 

23.0000 

10.0000 

76.0000 

23.0000 

24.0000 

15.0000 

75.8667 

5.0000 

7.0000 

21.0000 

75.8667 

5.0000 

23.0000 

7.0000 

75.6667 

5.0000 

23.0000 

11.0000 

75.6000 

8.0000 

18.0000 

4.0000 

75.5333 

4.0000 

17.0000 

19.0000 

75.5333 

5.0000 

11.0000 

17.0000 

75.5333 

24.0000 

26.0000 

14.0000 

75.4667  1 

5.0000 

23.0000 

28.0000 

75.4667 

4.0000 

15.0000 

26.0000 

75.3333 

17.0000 

24.0000 

19.0000 

75.3333 

5.0000 

23.0000 

25.0000 

75.2000 

5.0000 

7.0000 

17.0000 

75.2000 

4.0000 

15.0000 

23.0000 

75.0000 

5.0000  1 

23.0000 

17.0000 

74.9333 

5.0000 

23.0000 

3.0000 

74.8667 

4.0000 

26.0000 

15.0000 

74.8000 

23.0000 

24.0000 

19.0000 

74.8000 

5.0000 

23.0000 

14.0000 

74.8000 

5.0000 

23.0000 

1.0000 

74.8000 

24.0000 

26.0000 

25.0000 

74.7333 

24.0000 

30.0000 

19.0000 

74.7333 

5.0000 

23.0000 

19.0000 

;  74.7333 

5.0000 

23.0000 

9.0000 

74.6667 

5.0000 

7.0000 

22.0000 

74.6667 

4.0000 

26.0000 

19.0000 

Table  [4.4]  Results  of  combinations  of  3  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  1 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

81.0000 

5.0000 

21.0000 

23.0000 

9.0000 

80.6000 

5.0000 

7.0000 

23.0000 

6.0000 

80.2000 

5.0000 

21.0000 

23.0000 

11.0000 

79.6000 

5.0000 

21.0000 

23.0000 

10.0000 

79.4000 

5.0000 

7.0000 

23.0000 

12.0000 

79.4000 

5.0000 

10.0000 

23.0000 

21.0000 

79.0000 

5.0000 

7.0000 

23.0000 

28.0000 

79.0000 

5.0000 

7.0000 

23.0000 

19.0000 

79.0000 

5.0000 

21.0000 

23.0000 

26.0000 

78.8000 

5,0000 

11.0000 

23.0000 

7.0000 

78.6000 

5.0000 

21.0000 

23.0000 

12.0000 

78.4000 

5.0000 

21.0000 

23.0000 

15.0000 

78.4000 

5.0000 

10.0000 

23.0000 

8.0000 

78.0000 

5.0000 

11.0000 

23.0000 

21.0000 

78.0000 

5.0000 

7.0000 

23.0000 

20.0000 

78.0000 

5.0000 

7.0000 

23.0000 

14.0000 

77.8000 

5.0000 

7.0000 

23.0000 

2.0000 

77.8000 

5.0000 

21.0000 

23.0000 

28.0000 

77.8000 

5.0000 

21.0000  n 

23.0000 

6.0000 

77.8000 

5.0000 

21.0000 

23.0000 

3.0000 

77.8000 

5.0000 

23.0000 

29.0000 

26.0000 

77.8000 

5.0000 

23.0000 

29.0000 

22.0000 

77.6000 

10.0000 

21.0000 

26.0000 

2.0000 

77.6000 

5.0000 

7.0000 

23.0000 

22.0000 

77.6000 

5.0000 

10.0000 

23.0000 

19.0000 

77.6000 

5.0000 

23.0000 

29.0000 

19.0000 

77.6000 

5.0000 

23.0000 

29.0000 

1.0000 

77.4000 

10.0000 

21.0000 

26.0000 

9.0000 

77.4000 

5.0000 

11,0000 

23.0000 

10.0000 

77.4000 

5.0000 

11.0000 

23.0000 

8.0000 

Table  [6.1]  Results  of  combinations  of  4  features 


Percentage  of  correct  classification  for  30  best  combinations  in  set  2 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

81.0000 

5.0000 

23.0000 

29.0000 

14.0000 

79.8000 

5.0000 

10.0000 

23.0000 

21.0000 

79.6000 

5.0000 

21.0000 

23.0000 

11.0000 

79.4000 

14.0000 

24.0000 

26.0000 

19.0000 

79.4000 

5.0000 

21.0000 

23.0000 

9.0000 

79.2000 

5.0000 

21.0000 

23.0000 

13.0000 

79.0000 

5.0000 

11.0000 

23.0000 

3.0000 

79.0000 

5.0000 

23.0000 

29.0000 

21.0000 

78.8000 

5.0000 

23.0000 

29.0000 

6.0000 

78.6000 

4.0000 

19.0000 

17.0000 

25.0000 

78.6000 

5.0000 

21.0000 

23.0000 

10.0000 

78.4000 

4.0000 

19.0000 

17.0000 

6.0000 

78.4000 

5.0000 

23.0000 

29.0000 

19.0000 

78.2000 

5.0000 

11.0000 

23.0000 

25.0000 

78.2000 

5.0000 

11.0000 

23.0000 

6.0000 

78.2000 

4.0000 

15.0000 

28.0000 

27.0000 

78.2000 

5.0000 

7.0000 

23.0000 

11.0000 

78.2000 

19.0000 

24.0000 

30.0000 

11.0000 

78.0000 

5.0000 

21.0000 

23.0000 

27.0000 

77.8000 

19.0000 

24.0000 

30.0000 

23.0000 

77.8000 

19.0000 

24.0000 

30.0000 

16.0000 

77.8000 

5.0000 

10.0000 

23.0000 

11.0000 

77.6000 

4.0000 

19.0000 

17.0000 

3.0000 

77.6000 

5.0000 

7.0000 

23.0000 

28.0000 

77.4000 

14.0000 

24.0000 

26.0000 

20.0000 

77.4000 

5.0000 

21.0000 

23.0000 

30.0000 

77.2000 

5.0000 

11.0000 

23.0000 

8.0000 

77.2000 

4.0000 

19.0000 

17.0000 

11.0000 

77.2000 

5.0000 

7.0000 

23.0000 

26.0000 

77.2000 

5.0000 

21.0000 

23.0000 

12.0000 

Table  [6.2]  Results  of  combinations  of  4  features 


5 


Percentage  of  correct  classification  for  30  best  combinations  in  set  3 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

87.4000 

9.0000 

19.0000 

24.0000 

14.0000 

87.2000 

9.0000 

14.0000 

24.0000 

19.0000 

87.0000 

9.0000 

19.0000 

24.0000 

11.0000 

86.8000 

9.0000 

19.0000 

24.0000 

18.0000 

86.6000 

5.0000 

21.0000 

23.0000 

29.0000 

86.6000 

9.0000 

19.0000 

24.0000 

16.0000 

86.4000 

9.0000 

19.0000 

24.0000 

21.0000 

86.4000 

4.0000 

17.0000 

26.0000 

18.0000 

86.2000 

4.0000 

11.0000 

26.0000 

24.0000 

86.2000 

4.0000 

8.0000 

18.0000 

9.0000 

86.2000 

9.0000 

19.0000 

24.0000 

22.0000 

86.2000 

9.0000 

19.0000 

24.0000 

6.0000 

86.0000 

9.0000 

19.0000 

24.0000 

12.0000 

86.0000 

9.0000 

19.0000 

24.0000 

10.0000 

85.8000 

9.0000 

19.0000 

24.0000 

26.0000 

85.8000 

4.0000 

17.0000 

26.0000 

9.0000 

85.6000 

5.0000 

7.0000 

21.0000 

16.0000 

85.6000 

5.0000 

7.0000 

21.0000 

8.0000 

85.6000 

9.0000 

19.0000 

24.0000  1 

8.0000 

85.6000 

9.0000 

19.0000 

24.0000 

5.0000 

85.6000 

9.0000 

19.0000 

24.0000 

1.0000 

85.4000 

9.0000 

14.0000 

24.0000 

4.0000 

85.4000 

5.0000 

21.0000 

23.0000 

1.0000 

85.2000 

4.0000 

19.0000 

17.0000 

10.0000 

85.2000 

9.0000 

19.0000 

24.0000 

4.0000 

85.0000 

5.0000 

11.0000 

17.0000 

4.0000 

85.0000 

9.0000 

19.0000 

24.0000 

2.0000 

85.0000 

4.0000 

17.0000 

26.0000 

8.0000 

84.8000 

4.0000 

11.0000 

26.0000 

9.0000 

84.8000 

5.0000 

21.0000 

23.0000 

22.0000 

Table  [6.3]  Results  of  combinations  of  4  features 


6 


Percentage  of  correct  classification  for  30  best  combinations  on  average 


Percent  correct 

Feature  1 

Feature  2 

Feature  3 

Feature  4 

81.0667 

5.0000 

21.0000 

23.0000 

9.0000 

79.9333 

5.0000 

23.0000 

29.0000 

21.0000 

79.8667 

5.0000 

21.0000 

23.0000 

11.0000 

79.6000 

5.0000 

10.0000 

23.0000 

21.0000 

79.2667 

5.0000 

23.0000 

29.0000 

19.0000 

79.1333 

5.0000 

21.0000 

23.0000 

10.0000 

79.0667 

5.0000 

23.0000 

29.0000 

14.0000 

79.0000 

14.0000 

24.0000 

26.0000 

19.0000 

78.9333 

5.0000 

7.0000 

23.0000 

12.0000 

78.8667 

5.0000 

21.0000 

23.0000 

22.0000 

78.8667 

5.0000 

7.0000 

23.0000 

28.0000 

78.7333 

5.0000 

7.0000 

23.0000 

6.0000 

78.6667 

5.0000 

21.0000 

23.0000 

7.0000 

78.5333 

5.0000 

21.0000 

23.0000 

1.0000 

78.4667 

5.0000 

23.0000 

29.0000 

1.0000 

78.4000 

5.0000  n 

7.0000 

21.0000 

8.0000 

78.4000 

5.0000 

7.0000 

23.0000 

26.0000 

78.2667 

5.0000 

7.0000 

23.0000 

11.0000 

78.2000 

5.0000 

7.0000 

23.0000 

22.0000 

78.2000 

5.0000 

23.0000 

29.0000 

28.0000 

78.1333 

5.0000 

11.0000 

23.0000 

10.0000 

78.1333 

5.0000 

10.0000 

23.0000 

25.0000 

78.0667 

5.0000 

7.0000 

23.0000 

16.0000 

78.0000 

5.0000 

7.0000 

23.0000 

20.0000 

77.8667 

5.0000 

10.0000 

23.0000 

29.0000 

Table  [6.4]  Results  of  combinations  of  4  features 


k 

Correct 

classification 

Performance 

Index 

1 

73 

0.5196 

2 

74 

0.5099 

3 

77 

0.4796 

4 

77 

0.4796 

5 

82 

0.42 

6 

81 

0.4359 

7 

76 

0.4899 

8 

80 

0.4472 

9 

79 

0.4583 

10 

79 

0.4583 

Table[7.1]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  1 


k 

Correct 

classification 

Performance 

Index 

1 

74 

0.5099 

2 

74 

0.5099 

3 

77 

0.4796 

4 

77 

0.4796 

5 

74 

0.5099 

6 

76 

0.4899 

7 

76 

0.4899 

8 

75 

0.5000 

9 

78 

0.4690 

10 

78 

0.4690 

Table[7.2]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  2 


k 

Correct 

classification 

Performance  Index 

1 

79 

0.4583 

2 

79 

0.4583 

3 

81 

0.4359 

4 

84 

0.4000 

5 

83 

0.4123 

6 

85 

0.3873 

7 

81 

0.4359 

8 

81 

0.4359 

9 

82 

0.4243 

10 

82 

0.4243 

Table[7.3]  Classification  results  with  changing  K  for  the  crisp  classifier  for  set  3 


k 

Correct 

classification 

Performance 

Index 

1 

75.3333 

0.4959 

2 

75.6667 

0.4927 

3 

78.3333 

0.4650 

4 

79.3333 

0.4531 

5 

79.6667 

0.4474 

6 

80.6667 

0.4377 

7 

77.6667 

0.4719 

8 

78.6667 

0.4610 

9 

79.6667 

0.4505 

10 

79.6667 

0.4505 

Table[7.41  Average  classification  results  with  changing  K  for  the  crisp  classifier 
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percent  classification 


performanc 
e  index 


73 

73 

73 

73 

73 

73 

0.5196 

77 

75 

73 

74 

72 

73 

0.4267 

75 

74 

77 

75 

73 

69 

0.4261 

75 

74 

76 

77 

76 

69 

0.4157 

74 

74 

81 

79 

76 

73 

0.4061 

69 

74 

78 

79 

76 

74 

0.3993 

70 

74 

77 

81 

77 

72 

0.3980 

70 

75 

79 

79 

79 

72 

0.3977 

69 

72 

78 

80 

79 

71 

0.3971 

68 

73 

78 

79 

79 

70 

0.3978 

TablelS.l]  Classification  results  for  the  fuzzy  classifier  for  set  1 


percent  classification 


k  \  Threshold  |  0.3 


1 


2 


3 


0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

74 

74 

74 

74 

74 

74 

72 

75 

74 

77 

78 

77 

73 

75 

79 

79 

77 

73 

73 

75 

79 

76 

76 

72 

71 

76 

76 

78 

77 

74 

72 

73 

76 

79 

75 

72 

71 

73  1 

79 

79 

77 

70 

69 

74 

78  1 

80 

77 

70 

73 

75 

80 

79 

77 

70 

72 

73 

81 

79 

76 

72 

performance 


0.5099 


0.4328 


0.4316 


0.4262 


0.4176 


0.4164 


0.4092 


0.4099 


0.4059 


0.4004 


Table[8.2]  Classification  results  for  the  fuzzy  classifier  for  set  2 


I 


percent  classification 

performance 

index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

79 

79 

79 

79 

79 

79 

0.4583 

2 

73 

76 

79 

84 

84 

84 

0.3991 

3 

72 

75 

81 

85 

85 

82 

0.3862 

4 

75 

78 

84 

86 

86 

83 

0.3704 

5 

74 

80 

83 

86 

86 

84 

0.3635 

6 

75 

82 

OO 

87 

85 

83 

0.3588 

7 

74 

80 

82 

84 

84 

82 

0.3605 

8 

73  1 

78 

83 

84 

84 

81 

0.3638 

9 

73 

79 

83 

84 

85 

81 

0.3625 

10 

73 

80 

83 

84 

85 

82 

0.3615 

TableI8.3]  Classification  results  for  the  fuzzy  classifier  for  set  3 


percent  classification 

performanc 
e  index 

k  \  Threshold 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

1 

75.33 

75.33 

75.33 

75.33 

75.33 

75.33 

0.4959 

2 

74 

75.33 

75.33 

78.33 

78 

78 

0.4195 

3 

73.33 

74.67 

79 

79.67 

78.33 

74.67 

0.4146 

4 

74.33 

75.67 

79.67 

79.67 

79.33 

74.67 

0.4041 

5 

73 

76.67 

80 

81 

79.67 

77 

0.3957 

6 

72 

76.33 

79.67 

81.67 

78.67 

76.33 

0.3915 

7 

71.67 

75.67 

79.33 

81.33 

79.33 

74.67 

0.3892 

8 

70.67 

75.67 

80 

81 

80 

74.33 

0.3905 

9 

71.67 

75.33 

80.33 

81 

80.33 

74 

0.3885 

10 

71 

75.33 

80.67 

80.67 

80 

74.67 

0.3866 

TableI8.3]  Average  classification  results  with  for  the  fuzzy  classifier 
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File  Membership  Defuzzified 

Result 

34.0000  0.5970  0 

35.0000  0  0 

36.0000  0.1193  0 

0 

37.0000  0.3174  0 

38.0000  0.8117  1.0000 

39.0000  0.0997  0 

0 

40.0000  0.1889  0 

41.0000  0.4215  0 

42.0000  0.1635  0 

0 

43.0000  0.6474  1.0000 

44.0000  0  0  _ 

45.0000  0.5495  0 

0 

46.0000  0.1115  0 

0 

47.0000  0  0 

48.0000  0.3986  0 

49.0000  0  0 

50.0000  0  0 

0 

51.0000  0.6709  1.0000 

52.0000  1.0000  1.0000 

53.0000  0.5297  0 

1 

54.0000  0.7245  1.0000 

55.0000  0.9200  1.0000 

56.0000  1.0000  1.0000 

1 

57.0000  0.9105  1.0000 

58.0000  0.9398  1.0000 

59.0000  0.5657  0 

1 

60.0000  0.8968  1.0000 

61.0000  1.0000  1.0000 

62.0000  0.2793  0 

63.0000  0.1088  0 

0  Misclassified 

64.0000  0.6245  1.0000 

65.0000  0.8643  1.0000 

66.0000  0.5054  0 

1 

Table  19.1]  Continued 


File 

Membership 

Defuzzified 

Result 

67.0000 

0.8498 

1.0000 

68.0000 

0.6969 

1.0000 

69.0000 

0.8397 

1.0000 

1 

r- -  f 

70.0000 

0.2901 

0 

71.0000 

0.8291 

1.0000 

72.0000 

0.3982 

0 

0  Mi^lassified 

-  1 

73.0000 

1.0000 

1.0000 

74.0000 

0.2463 

0 

75.0000 

0.8043 

1.0000 

1 

-  J 

76.0000 

0.6676 

1.0000 

77.0000 

1.0000 

1.0000 

78.0000 

1.0000 

1.0000 

1 

-  J 

79.0000 

1.0000 

1.0000 

80.0000 

0.7538 

1.0000 

81.0000 

1.0000 

1.0000 

1 

-  □ 

82.0000 

1.0000 

1.0000 

83.0000 

0.8378 

1.0000 

84.0000 

1.0000 

1.0000 

1 

-  1 

85.0000 

0.8926 

1.0000 

86.0000 

0.5448 

0 

87.0000 

0.5751 

0 

0  Misclassifled 

— 

88.0000 

0.8273 

1.0000 

89.0000 

0.2945 

0 

90.0000 

0.9110 

1.0000 

1 

r— - 

91.0000 

1.0000 

1.0000 

92.0000 

1.0000 

1.0000 

93.0000 

0 

0 

1 

94.0000 

0.2887 

0 

95.0000 

0.2079 

0 

96.0000 

0.5793 

0 

0  Misclassifled 

— 

97.0000 

1.0000 

1.0000 

98.0000 

0.7971 

1.0000 

99.0000 

0.8708 

1.0000 

1 

— 

1  100.0000 

1.0000 

1.0000 

1 

Table  [9.11  Continued 


File 

Membership 

Defuzzified 

Result 

1.0000 

0.2579 

0 

2.0000 

0.1307 

0 

3.0000 

0 

0 

0 

4.0000 

0.2652 

0 

5.0000 

0.4345 

0 

6.0000 

0.1175 

0 

0 

7.0000 

1.0000 

1.0000 

8.0000 

0.7086 

1.0000 

1  Misclassified 

9.0000 

0.2856 

0 

10.0000 

0.2745 

0 

11.0000 

0.3056 

0 

0 

12.0000 

0.2720 

0 

13.0000 

0.5019 

0 

14.0000 

0.8871 

1.0000 

0 

15.0000 

0.0912 

0 

16.0000 

0 

0 

17.0000 

0 

0 

0 

18.0000 

0.8334 

1.0000 

1  Misclassified 

19.0000 

0 

0  n 

20.0000 

0 

0 

21.0000 

0.5483 

0 

0 

22.0000 

0 

0 

23.0000 

0 

0 

24.0000 

0.1535 

0 

0 

25.0000 

0.4955 

0 

26.0000 

0.1013 

0 

27.0000 

1.0000 

1.0000 

0 

28.0000 

0.3788 

0 

29.0000 

0.1638 

0 

30.0000 

0.0905 

0 

0 

31.0000 

0 

0 

32.0000 

0.1431 

0 

33.0000 

0.0937 

0 

0 

Table  [9.2]  Classification  of  the  files  of  set  2 


File 

Membership 

Defuzzifled 

Result 

34.0000 

0 

0 

35.0000 

0 

0 

36.0000 

0.1281 

0 

0 

37.0000 

0.3690 

0 

38.0000 

0.5734 

0 

39,0000 

0.1569 

0 

0 

40.0000 

0.3659 

0 

41.0000 

0.4124 

0 

42.0000 

0.1704 

0 

0 

43.0000 

0.4251 

0 

44.0000 

0.0664 

0 

45.0000 

0.5356 

0 

0 

46.0000 

0.5084 

0 

0 

47.0000 

0.1735 

0 

48.0000 

0.7512 

1,0000 

49.0000 

0.5115 

0 

50.0000 

0.0976 

0 

0 

i 

51.0000 

0.6361 

1.0000 

52.0000 

0.8482 

1.0000 

1 

53.0000 

0-3471 

0 

54.0000 

0.8822 

1.0000 

55.0000 

1.0000 

1.0000 

1 

56.0000 

1.0000 

1.0000 

57.0000 

1,0000 

1,0000 

58.0000 

0.8730 

1,0000 

1 

59.0000 

0 

0 

60.0000 

0.0389 

0 

61.0000 

0.3643 

0 

0  Misclassified 

62.0000 

1.0000 

1.0000 

63.0000 

0.8174 

1.0000 

64.0000 

0,8875 

1.0000 

1 

65.0000 

0.7995 

1.0000 

66.0000 

0,5919 

0 

67.0000 

0.7533 

1.0000 

1 

Table  [9.2]  Continued 


File  Membership  Defuzzified 

Result 

68.0000  0.7337  1.0000 

69.0000  0.8524  1.0000 

70.0000  0.8602  1.0000 

1 

71.0000  0.2217  0 

72.0000  1.0000  1.0000  _  . 

73.0000  0.1268  0 

0  Misclassifled 

74.0000  0.8860  1.0000 

— 

75.0000  0.2121  0 

76.0000  0.1684  0 

77.0000  0.6903  1.0000 

0  Misclassifled 

78.0000  0.7680  1.0000 

79.0000  0.8735  1.0000 

- ___ 

80.0000  0.8013  1.0000 

1 

81.0000  0.1748  0 

82.0000  0.5428  0 

83.0000  0.8496  1.0000 

0  Misclassifled 

84.0000  0.3444  0 

85.0000  0.8298  1.0000 

86.0000  0.8590  1.0000 

1 

87.0000  0.6879  1.0000 

88.0000  0.9082  1.0000 

89.0000  0.6653  1.0000 

1 

90.0000  0.1636  0 

91.0000  0.8754  1.0000 

92.0000  0.8594  1.0000 

1 

93.0000  0.5185  0 

94.0000  0.4932  0 

95.0000  0.7802  1.0000 

0  Misclassifled 

96.0000  0.8684  1.0000 

97.0000  0.8788  1.0000 

98.0000  1.0000  1.0000 

1 

99.0000  1.0000  1.0000 

100.0000  0.8669  1.0000 

1 

Table  [9.2]  Continued 


File 

Membership 

Defuzzifled 

Result 

34.0000 

0.3917 

0 

35.0000 

0 

0 

36.0000 

0 

0 

0 

37.0000 

0.1689 

0 

38.0000 

0.5220 

0 

39.0000 

0 

0 

0 

40.0000 

0.0969 

0 

41.0000 

0 

0 

42.0000 

0 

0 

0 

43.0000 

0.4810 

0 

44.0000 

0.3154 

0 

45.0000 

0.4552 

0 

0 

46.0000 

0.3285 

0 

0 

47.0000 

0.3690 

0 

48.0000 

0,5593 

0 

49.0000 

0.3522 

0 

50.0000 

0.2325 

0 

0 

51.0000 

1.0000 

1.0000 

52.0000 

0.9052 

1.0000 

53.0000 

0.8115 

1.0000 

1 

54.0000 

0.8397 

1.0000 

55.0000 

0.8754 

1.0000  i 

56.0000 

0.0930 

0 

j 

1 

57.0000 

0.8330 

1.0000 

58.0000 

1.0000 

1.0000 

1 

59.0000 

1,0000 

1,0000 

60.0000 

1.0000 

1,0000 

61.0000 

1.0000 

1.0000 

1 

62.0000 

1.0000 

1.0000 

63.0000 

0.6496 

1.0000 

64,0000 

0.5075 

0 

1 

65.0000 

0.0823 

0 

66.0000 

0.7810 

1.0000 

67.0000 

0.2356 

0 

0  Misclassifled 

Table  [9.3]  Continued 


File 

Membership 

Defuzzified 

Result 

68.0000 

1.0000 

1.0000 

69.0000 

1.0000 

1.0000 

70.0000 

1.0000 

1.0000 

1 

71.0000 

1.0000 

1.0000 

72.0000 

1.0000 

1.0000 

73.0000 

1.0000 

1.0000 

1 

74.0000 

1.0000 

1.0000 

75.0000 

1.0000 

1.0000 

76.0000 

1.0000 

1.0000 

1 

77.0000 

1.0000 

1.0000 

78.0000 

1.0000 

1.0000 

79.0000 

1.0000 

1.0000 

1 

80.0000 

0.6068 

1.0000 

81.0000 

0.9054 

1.0000 

82.0000 

0.4134 

0 

1 

83.0000 

1.0000 

1.0000 

84.0000 

0 

0 

85.0000 

0.2914 

0 

0  Misclassified 

86.0000 

1.0000 

1.0000 

87.0000 

1.0000 

1.0000 

88.0000 

0.8786 

1.0000 

1 

89.0000 

0.9018 

1.0000 

90.0000 

1.0000 

1.0000 

91.0000 

1.0000 

1.0000 

1 

92.0000 

1.0000 

1.0000 

j 

93.0000 

0.9135 

1.0000 

94.0000 

0.8292 

1.0000 

1 

95.0000 

0.7423 

1.0000 

96.0000 

1.0000 

1.0000 

97.0000 

0.0902 

0 

1 

98.0000 

0.2564 

0 

99.0000 

0 

0 

100.0000 

0.4387 

0 

0  Misclassified 

Table  [9.3]  Continued 


Non  deceptive 

Deceptive  1 

Deceptive  2 

Deceptive  3 

QQ8R9OIO.011 

QQ4Q1O83.011 

QQ7LX5Q0.021 

QQ8RAJ0C.011 

QQ8R9OIO.021 

QQ4Q1O83.021 

QQ7LX5Q0.031 

QQ8RAJ0C.021 

QQ8R9OIO.031 

QQ4Q 1083. 031 

QQ7MN2Y0.011 

QQ8RAJ0C.031 

QQ95LU1T.011 

QQ4Q3MDC.011 

QQ7MN2Y0.021 

QQ9EUKVT.011 

QQ95LU1T.021 

QQ4Q3MDC.021 

QQ7MN2Y0.031 

QQ9EUKVT.021 

QQ95LU1T.031 

QQ4Q3MDC.031 

QQ7TC5UF.011 

QQ9EUKVT.031 

QQAURNUS.021 

QQ51DE36.011 

QQ7TC5UF.021 

QQ9100X0.021 

QQAURNUS.031 

QQ51DE36.021 

QQ7TC5UF.031 

QQ9IOOXO.041 

QQAV53P6.011 

QQ51DE36.041 

QQ7TQVER.011 

QQ9SOW8L.011 

QQAV53P6.021 

QQ6RQGH6.011 

QQ7TQVER.021 

QQ9SOW8L.021 

QQAV53P6.031 

QQ6RQGH6.021 

QQ7TQVER.031 

QQ9SOW8L.031 

QQBQ4SHI.011 

QQ6RQGH6.031 

QQ7TVADC.011 

QQ9SQIK9.011 

QQBQ4SHI.021 

QQ6RQGH6.041 

QQ7TVADC.021 

QQ9SQIK9.021 

QQBQ4SHI.031 

QQ6T7 110.0 11 

QQ7TVADC.031 

QQ9SQIK9.031 

QQBSS7WT.01 1 

QQ6T71 10.021 

QQ7U2T4R.011 

QQ9W0B9F.011 

QQBSS7WT.021 

QQ6T71 10.031 

QQ7U2T4R.021 

QQ9W0B9F.031 

QQBSS7WT.031 

QQ6Z59IG.011 

QQ7U2T4R.031 

QQ9W0B9F.041 

QQ70XM60.021 

QQ6Z591G.021 

QQ7YP7QU.011 

QQ9U4FMU.011 

QQ7RH0RO.011 

QQ6Z59IG.031 

QQ7YP7QU.021 

QQ9U4FMU.021 

QQ7RH0RO.021 

QQ7PP9B9.011 

QQ7YP7QU.031 

QQ9U4FMU.031 

QQ7RH0RO.031 

QQ7PP9B9.021 

QQ7YZOJ3.011 

QQ9Y  SVF.Oll 

QQ7R51P9.011 

QQ7PP9B9.031 

QQ7YZOJ3.021 

QQ9Y_SVF.021 

QQ7R51P9.021 

QQ7PDU1X.011 

QQ7YZOJ3.031 

QQ9Y  SVF.031 

QQ7R51P9.031 

QQ7PDU1X.021 

QQ8  ODPT.Oll 

QQ9YH3QF.011 

QQ9TDSP3.011 

QQ7PDU1X.031 

QQ8  0DPT.021 

QQ9YH3QF.021 

QQ9TDSP3.021 

QQ7  PIPF.Oll 

QQ8  0DPT.031 

QQ9YH3QF.031 

QQ9TDSP3.031 

QQ7  PIPF.021 

QQ8  0DPT.041 

QQA2TT4C.011 

QQA8OWOI.011 

QQ7  PIPF.031 

QQ8  2UQ9.011 

QQA2rr4C.021 

QQA8OWOI.021 

QQ7  JT70.011 

QQ8_2UQ9.021 

QQA2TT4C.031 

QQA8OWOI.031 

QQ7_JT70.021 

QQ8  2UQ9.031 

QQA3HIRX.011 

QQBT22O6.011 

QQ7  JT70.031 

QQ8001G6.011 

QQA3HIRX.021 

QQBT22O6.021 

QQ738DYX.011 

QQ8001G6.021 

QQA3HIRX.031 

QQBT22O6.031 

QQ738DYX.021 

QQ8001G6.031 

QQA32UTF.011 

QQBO9O_9.011 

QQ738DYX.031 

QQ82OIU9.011 

QQA32UTF.021 

QQB090  9.021 

QQ75ULP9.011 

QQ82OIU9.021 

QQA32UTF.031 

QQB090  9.031 

QQ75ULP9.021 

QQ82OIU9.031 

QQA6U_IF.011 

QQBC7PP6.011 

QQ75ULP9.031 

QQ82SUTX.011 

QQA6U_IF.031 

QQBC7PP6.021 

QQ79  EYF.Oll 

QQ82SUTX.021 

QQA6U  IF.041 

QQBC7PP6.031 

QQ79_EYF.021 

QQ82SUTX.031 

QQAM4E3L.011 

QQCHCK  0.011 

QQ79  EYF.031 

QQ860ZNU.011 

QQAM4E3L.021 

QQCHCK  0.021 

QQ7BGDML.011 

QQ860ZNU.021 

QQAM4E3L.031 

QQCHCK  0.031 

QQ7BGDML.021 

QQ860ZNU.031 

QQARF2  X.011 

QQCDTKPO.Oll 

QQ7BGDML.031 

QQ89U_ZR.011 

QQARF2  X.021 

QQCDTKP0.031 

QQ7ETC8I.011 

QQ89U  ZR.021 

QQARF2  X.031 

QQCDTKP0.041 

QQ7ETC8I.021 

QQ89U_ZR.031 

QQAWA38X.011 

QQCM5Y56.011 

QQ7ETC8I.031 

QQ8ATU26.011 

QQAWA38X.021 

QQCQQT8Y.011 

QQ7JAQCS.011 

QQ8ATU26.021 

QQAWA38X.031 

QQCQQT8Y.021 

QQ7JAQCS.021 

QQ8ATU26.031 

QQAYXZGU.Oll 

QQCQQT8Y.031 

QQ7JAQCS.031 

QQ8FGMVI.011 

QQAYXZGU.021 

QQCQQT8Y.041 

QQ7LX5Q0.011 

QQ8FGMVI.021 

QQAYXZGU.031 

Table  [10]  NSA  Polygraph  files  used  in  sets  1-3. 


Note:  Each  set  consists  of  non-deceptive  files  and  one  of  the  deceptive  sets 
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Appendix  B: 
Program  Listings 


Classify  Program 


%  This  is  a  Matlab  program 
%  This  script  parses  a  matrix  of  polygraph 
%  vectors  into  training  and  testing  vectors. 

%  It  then  calls  the  classifier,  trains,  tests 

%  and  gives  results. 

c  =  2; 

%  number  of  classes 

percent_train=.75; 

%  percentage  of  inputs  \ised  for  training 

features=[l] 

%  features  to  use 

classification=l;  % 

use  fuzz>’  classifier 

kk=5; 

%  K  in  K  nearest  neighbor 

change=l; 

%  Randomize  training  and  testing  inputs 

repeat=20; 

%  Number  of  repeatitions 

ut=.5; 

%  Upper  ihreshhold  for  3  class  fuzzy  classifier 

lt=.5; 

%  Lower  threshhold  for  3  class  fazzy^  classifier 

load  set31; 

%  file  containing  feature  matrix 

%  and  vector  that  indicates  whether 

%  column  is  truthful  or  deceptive 

%class\’ect; 

%  vector  of  classes  eg.  1  =  deceptive 

%  0  =  truthful  vector 

featurematrix  =  featmat;  %  matrix  of  features 

dimension  =  size(featurematrix); 

columns  =  dimension(2);  %  the  total  number  of  columns  in  the  feature  matrix 

number  train  =  round(percent_train*columns);  %  number  of  vectors 

%  used  for  training 

ur=.5; 

%upper  threshold 

continue=l; 

while  (continue=l) 
1 

%  to  repeat  the  program 

apercent_classified=[] ;  %  clear  average  results 

acorrect=[]; 

acc=[]; 

fFresult=[]; 

ccresult=[]; 

ttestclass=[]; 

men=0; 

while(men  ~=7) 

men=menu('Select:', 'Features', Tjpe'/K', 'Random'... 
,'Repeat','%  training', 'Start', 'Defiizz'.'Exil'); 


if  (men==l) 

'enter  a  \'ector  of  the  features  you  want  tested  (eg.  [12  4])  ' 
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features  =  inputC ');  %  features  being  tested 

end 

if  (men— 2) 

classification=menu(T>pe:','Fuzzj’', 'Crisp'); 
end 

if  (men=3) 

kk  =  input('enter  the  "K"  in  K  nearest  neighbor  ') 
end 

if  (men=4) 

change=menu('Selection','Random', 'Constant'); 
end 


if  (men==5) 

repeat=input('Enter  number  of  repeatitions') 
end 

if  (men==6) 

percent_train=input('Enter  percentage  of  the  files  used  for  training,  1  for  all-1') 
end 

if  (men==8) 

ch=menu('Defii2zification',  '3 class',  Upper  thresh','Lower  thresh'); 

if  ch=l,  classification=3,  end 

ifch==2 

ut=input('enter  the  upper  threshhold');  %  lower  limit  for  class  1 
end 

ifch=3 

lt=input('enter  the  lower  threshhold');  %upper  limit  for  class  0 
end 
end 

if  (men==9)  break,end 
end 

if  men==9  break,end 

number_train  =  round(percent_lrain*columns); 

acorrect=[] ;  %  vector  for  the  average  of  correct  classification 

acc=  [] ;  %  vector  for  the  average  of  performance  index 

if  percent_train  =1  %  To  repeat  nonrandom  testing  for  all  the  files. 

repeat  =columns; 
end 

for  lrial=l :  repeat 

featurematrix  =  featmat(features,:);  %  creates  a  feature  matrix  of  the 

%  the  features  being  tested 
if  ( (change— 1)  &  (percent_train— 1) ) 

[traim’ect,  testv'ect]  =  randvect(number_train,columns); 

end; 

if  percenttrain  ==  1 

tesuect  =  trial; 
if  (trial  ==1) 

trainvect=2  ;columns; 


end 

if  (trial  ==  columns) 

lraim’ect=l  :columns-l ; 
end 

if  ( trial  -=1  &  trial  ~=columns ) 

trainvect  =  [l;trial-l ,  trial+l;columns]; 
end 
end 

tesriect 

traimect 

u  =  featurematrix(:,testvect);  %  testing  matrix 

testclass  =  class\'ect(l,testvect);  %  class  of  each  column  in  testing  matrix 


p  =  featureinatrix(;, trainvect);  %  training  matrix 
t  =  class\’ect(Ktrainvect);  %  class  of  each  column  in  training  matrix 


if  classification  =  1  %  Fuzzy  classifier 

%  m  =  input('enter  the  degree  of  fuzziness  "M"  (l<=M<=infinfity)') 
m  =  2; 

save  fdatafil  c  kk  m  p  t  u 

%  Ifknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dos('del  foutfile.mati')  %to  make  sure  that  the  program  actulally  works 

dos('fknnl') 

'Now  loading  the  result  of  the  fuzz>-  classifier* 
load  foutfile 

1 _ ' 

kk,  features 

fresult 

testclass 

tf(percent_train=  1 ) 

firesult=[ffresult  fresult] 
ttestclass=[ttestclass  testclass] ; 
end 

cr  =fresult(2,:)  >  ut  %  defuzzification  of  the  result 

correct  =  100*(l-mean(abs(testclass-cr)))  %  percentage  conect  classified 

cc  =  [  1  -testclass;  testclass] ;  %  adding  a  row  of  complements  to  c 

cc=fresult-cc; 

Terformance  Index-' 

cc  =  sqrt(mean(mean(cc  2))) 


if  classification  =  2  %  crisp  classifier 

sa\e  cdatafil  c  kk  p  t  u 

%  Icknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dosCdel  foutfile.mati')  %to  make  sure  that  the  program  actulally  works 

dosCcknnl') 

'Loading  the  Crisp  output  file' 
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load  coutfile 

I _ _ _ _ _  ' 

kk,  features 

cresult 

testclass 

if(percent_train— 1 ) 

ccresult=[ccresult  cresult] 
ttestclass=[ttestclass  testclass); 
end 

correct  =  100*(l-mean(abs(testclass-cresult)))  %  percentage  correct  classified 
cc  =  sqrt(niean(abs(testclass-cresult)))  %  performance  index 

end 

if  classification  =  3  %  Fuzz>'  classifier  but  defuzzification  into  3  classes 

%  m  =  input('enter  the  degree  of  fuzziness  "M"  (l<=M<=infinfit)’)') 
m  =  2; 

sa\’e  fdatafil  c  kk  m  p  t  u 

%  Ifknn  %This  line  invokes  the  classifier  program  in  a  dos  window 

dosC'del  foutfile.matl')  %to  make  sure  that  the  program  actulally  works 

dos('fknnl’) 

'Now  loading  the  result  of  the  fuzzj'  classifier' 
load  foutfile 

1 _ ' 

kk.  features 

fresult 

testclass 

if(percent_train==  1 ) 

ffresult=[ffresult  fresult] 
ttestclass=  [ttestclass  testclass] ; 
end 

class  l=find(fresult(2,:)  >ut); 
class0=find(fresult(2,:)  <lt); 
class3=find(fresult(2,:)  >lt  &  fresult(2,:)  <ut); 

percent_classified=100*((length(class0)+length(classl))/length(testclass)) 
fr=[fresult(:,classl)  fi-esult(:,classO)]  %  the  section  that  is  classified  into  one  of  the  two 

classes 

cr=fr(2,:)>ut 

tr=  [testclass(class  1 )  testcIass(classO)]  %  the  section  that  is  classified  into  one  of  the  two 

classes 

correct  =  100*(l-mean(abs(tr-cr)))  %  percentage  correct  classified 
cc  =  [1-tr;  tr]  ;  %  adding  a  row  of  complements  to  cc 

cc=fr-cc; 

Terformance  Index=' 
cc  =  sqrt(mean(mean(cc  2))) 
end 

apercent_classified  =  [apercent_classLfied  percent_classified] 
acorrect=[acorrect  correct] 
acc=[acc  cc] 


end 


%  for  trial 


if  classification  =3  %  3  class  fiizz)' 

apercent_classified=mean(apercent_classified) 

end 

acorrect,  niean(acorrect) 
acc,  mean(acc) 

continue=3; 

while  (continue  =  3  |  continue==4) 
continue=menu('Repeat?',  "Yes',  'no', 'Plot',  'threshold'); 
if(continue=3) 

dini=nienu('Dimension',  Two',  Three')+1; 
if(dim=2) 

pp=p(;,find(t)); 
plot(pp(l,:),pp(2,:),'r+'); 
titleCA  clustering  of  two  class  data'); 
hold  on 

pp=p(:,find(t=0)); 
plot(pp(l,;),  pp(2,:),  'gx'); 

pp=u(:,  find(testclass)); 
plot(pp(l,:),  pp(2,;),  'r+'); 
pp=u(:,find(testclass=0)); 
plot(pp(l,;),  pp(2,:),  'gx'); 

hold  off 

end  %if(dim=2) 


if(dim==3) 


pp=p(;,find(t)); 

plot3(pp(l,:),pp(2,:),  pp(3,:),  'r+'); 
title('A  clustering  of  two  class  data'); 
hold  on 

pp=p(:,find(t=0)); 
plot3(pp(l,:),  pp(2,:),  pp(3,:),  'rx'); 

pp=u(:,  find(testclass)); 
plot3(pp(l,:),  pp(2,;).  pp(3,:),  'g+'); 
pp=u( :  ,find(testclass=0)); 
plot3(pp(l,:),  pp(2,:),  pp(3,:),  'gx'); 

hold  off 

end  %if(dim— 3) 

end  %if(continue=3) 
if  (continue=4) 

ch=menu('Defuzzification',  '3class',  'Upper  thresh','Lower  thresh'); 
ifch==l,  classilication=3,  end 
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ifch=2 


ut=input('enter  the  upper  threshhold');  %  lower  limit  for  class  1 
end 

ifch=3 

lt=input('enter  the  lower  threshhold');  %upper  limit  for  class  0 
end 

if  classification=l 

cr  =ffresult(2,:)  >  ut  %  defuzzification  of  the  result 

correct  =  100'''(l-mean(abs(ttestclass-cr)))  %  percentage  correct  classified 

cc  =  [  1  -ttestclass;  ttestclass] ;  %  adding  a  row  of  complements  to  c 

cc=ffresult-cc; 

'Performance  Index- 
cc  =  sqrt(mean(mean(cc  2))) 
end 

if  classification— 2 

correct  =  100*(l-mean(abs(ttestclass-ccresult)))  %  percentage  correct  classified 
cc  =  sqrt(mean(abs(ttestclass-ccresult)))  %  performance  index 
end 

if  classification==3 

class  l=find(fffesult(2,:)  >ut); 
class0=find(ffresult(2,:)  <lt); 
class3=find(ffresult(2,:)  >lt  &  fffesult(2,:)  <ut); 

fr=[ffresult(;,classl)  fEiresult(:,classO)]  %  the  section  that  is  classified  into  one  of 

the  nvo  classes 

cr=fr(2,:)>ut 

tr=[ttestclass(classl)  ttestclass(classO)]  %  the  section  that  is  classified  into  one  of 

the  two  classes 

percent_classified=100*((length(class0)+length(classl))/length(ttestclass)) 
correct  =  100*(l-mean(abs(tr-cr)))  %  percentage  correct  classified 
cc  =  [l-tr;  tr];  %  adding  a  row  of  complements  to  cc 
cc=ff-cc; 

'Performance  Index-' 
cc  =  sqrt(mean(mean(cc  2))) 
end 
end 

end  %  while  continue  =  3  1 4 


end 


%  while  continue 


This  program  implements  a  K-nearest  neighbor  classifier, 
created  by:  Shahab  Layeghi 

created:  8/4/93 
last  modified:  9/17/93 


/♦  The  main  program  opens  a  matlab  data  file,  reads  the  training  matrix, 
classifies  each  entr>'  in  the  testing  matrix,  and  wxites  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "cdatafil.mat".  As  the  name  implies  it  is  in  matlab  file  format. 

The  data  in  this  file  should  have  the  following  order; 

1.  A  single  variable  'C  which  is  the  number  of  classes. 

2.  A  single  variable  'K'  which  is  the  parameter  TC'  in  K-NN  Algorithm. 

3.  A  trainig  matrix  T'  w  hich  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

4.  A  classes  vector  T  which  contains  the  classes  of  the  training  set 

5.  An  input  matrix  U  which  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  CrispKNN  routine  to  classify  each  one  of  the  input 
vectors  and  saves  the  results  (the  classes  that  these  inputs  belong  to)  in  a 
file  called  coutfile.mat.  This  file  is  in  Matlab  format.  This  file  contains 
a  vector  of  the  classes  called: 

'cresult' 


This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escpae 
character '!'.  An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "ckimtest.m". 

*1 

#include  <stdio.h> 

#include  <stdlib.h> 

#include  <time.h> 

#include  <math.h> 

#include  <conio.h> 

#define  INPUTFILE  "cdatafil.mat" 

#define  OUTPUTFILE  "coutfile.mat" 

//  Function  Prototjpes - 

int  CrispKNN(double  *Input,  double  *Samples,  double  ♦Lables); 
double  FindDistance(double  *vecl,  double  *\ec2); 
double  Maxd(double  *vec,  int  *index,  int  Length); 
int  FindMax(int  *vector,  int  *count,  int  Length,  int  Max); 
int  loadmat(Fn-E  *  fp,int  *t>pe,  char  *pname,  int  ♦mrows,  int  *ncols, 
int  *imagf,  double  **preal,  double  **pimag); 
void  savemat(FILE  *fp,  int  tjpe,  char  *pname,  int  mrows,  int  ncols, 
int  imagf,  double  ♦pr^,  double  *pimag); 


//  Global  variables,  these  variables  will  be  set  by  reading  matlab  file - 

int  classes;  /*  the  number  of  classes  */ 

int  features;  /*  Number  of  features  in  a  class  */ 


intKK; 
int  SampleSize; 
int  TestSize; 


/*  K  in  K-nearest  neighbors  */ 

/*  Number  of  Labled  Samples  */ 


//■ 


void  mainO 

{ 


double  *Lables; 
double  *KP; 
double  ♦input; 
int  ij; 

FILE  *fp; 
char  name[20]; 
int  npe,  imagf; 

double  *Samples,  ♦isamples;  //  isamples  is  for  imaginary  part  of  the  matrix  that  is  not  used  in 

here. 

double  *Testdata; 
double  *result; 
fp=fopen(INPUTFE-E,"ib"); 
if(!^)  { 

printfC'cannot  open  the  file"); 
exit(-l); 

} 

//  read  classes  from  the  file 

loadmat(fp,  &t)pe,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 

if(i!=l||j!=l){ 

printfC'error:  You  should  include  classes  at  the  beginning  of  the  file\n"); 
exit(-l); 

} 

classes=*KP; 

//  read  KK  from  the  file 

loadmat(fp.  &type,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 
if(i!=l||j!=l){ 

printfC'error:  You  should  include  K  at  the  beginning  of  the  file\n"); 
exit(-l); 

} 

KK=*KP; 

//  read  the  matrix  from  the  datafile. 

loadmat(Ip,  &t)pe,  name,  Afeatures,  ASampleSize,  Aimagf,  ASamples,  Aisamples); 

//  reading  lables  from  data  file 

loadmat(fp,  Atype,  name,  Ai,  Aj,  Aimagf,  ALables,  Aisamples); 
if(i!=l  II  j!=SampleSize)  { 

printfC'error:  Number  of  labels  is  different  from  the  number  of  samples\n"); 
exit(-l); 

} 
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//  read  data  to  be  classified  from  the  file 

loadmat(fp,  &t>pe,  name,  &i,  &TestSize,  &imagf,  &Testdata,  &isamples); 
if(i  !=  features)  { 

printfC'enor:  Training  and  testing  matrices  should  have  the  same  size"); 
exit(-l); 

} 

//  Allocate  space  for  result  vector 

result  =  (double  *)  malloc(TestSize''‘sizeof(double)); 
if(!  result)  { 

printf("Error;  cannot  allocate  memor>'  for  the  result  vector"); 
exit(-l); 

} 

for(i=0;  i<TestSize;  1++)  {  //  for  each  input 

input=Testdata+i*features; 
result[i]=CrispKNN(input,  Samples,  Lables); 

//  printfC'class:  %lf\n",  result[i]); 

} 

fclose(fp); 

//  printf(''\n  End  of  classification.  Now  writing  the  result  in  the  file"); 

fp=fopen(OUTPUTFILE,  "wb"); 
if(!fp)  { 

printfC'Error;  Cannot  write  the  file"); 
getchO; 

} 

savemat(fp,  0,  "cresult",  1,  TestSize,  0,  result,  result); 
fclose(fp); 

} 

/♦ - */ 

int  CrispKNN(double  *lnput,  double  *Samples,  double  •Lables) 

{ 

int  i  j  ; 
int  nj,  k,  nk; 
double  *distance; 
int  *index; 
double  x,y; 

distance  =  (double  *)  malloc(KK*sizeof(double)); 
if(!  distance)  { 

printfC'Error:  Not  enough  memory  for  distance  vector"); 
exit(-l); 

} 

index  =  (int  '^)  malloc(KK*sizeof(int)); 
if(!  index)  { 

printf("Error:  Not  enough  memory  for  index  vector"); 
exit(-l); 

} 
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for(i=0;  i<KK;  i-H-)  {  //  This  loop  initializes  K  nearest  neighbors  to  the  first  K  Samples 
index[i]=Lables[i]+l; 

distance[i]=Fin(iDistance(Input,  &Samples[i*features]); 

} 

for(i=KK;  i<SampleSize;  i++)  {  //  This  is  the  loop  that  finds  the  K  nearest  Neighbors 
x=Maxd(distance,  &j,  KK); 
j  -FindDistance(Input,  &Samples[i*features]); 

1%  <  x)  {  //  This  sample  is  closest  to  the  input  than  the  ferthest  K  Neighbors 
distance[j]=y; 
index[j]=Lables[i]+ 1 ; 

} 

} 

j=FindMa\(index,  &nj,  KK,  classes);  //  Finds  the  class  of  maximum  occurance 

/*  In  this  section  it  is  checked  to  see  if  there  is  a  tie.  That  is  if 
there  are  two  or  more  classes  with  the  same  number  of  occureances.  If 
there  is  a  tie  for  two  classes,  the  class  with  the  minimum  sum  of 
distances  is  selected.  No  action  is  taken  for  a  tie  of  more  than  two 
classes.  */ 


for  (i=0;  i<KK;  i-H-) 

if(index[i]==j)  index[i]=0; 
k=FindMax(index,  &nk,  KK,  classes); 
if(nk=nj)  {  //If  there  is  a  tie. 

x=0; 

for(i=0;  i<KK;  i-H-)  { 

if(index[i]==0) 

x-t-=distance[i]; 

} 

for(i=0;  i<KK;  i-H-)  { 
if(index[i]==k) 

yH-=distance[i]; 

} 

if(y<x) 

less  than  that  of  class  k 


} 


j=k; 


fTee(distance); 
fi’ee(index); 
return  j-1; 


//If  sum  of  the  distances  to  class  j  is 


} 

/* - - - - - */ 

/*  This  function  returns  the  Euclidian  distance  between  two  vectors  */ 

double  FindDistance(double  *vecl,  double  •vec2) 

{  . 


int  k; 

double  distance; 
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distance  =  0; 

for(k=0;  k<features;  k-H-)  { 
distance  + 

//  distance  + 

} 

return  distance; 

} 


/*  This  function  finds  the  biggest  element  of  an  array.  It  returns  that 
value  and  also  returns  the  index  to  that  element  in  index. 

*/ 

double  Maxd(double  *vec,  int  ‘index,  int  Length) 

{ 

int  ij=0; 
j=0; 

for(i=l;  i<Length;  i-H-) 

if(vec[i]>vec|j])  j=i; 

*index=j; 

retum(vec[j]); 

} 

/♦ - *! 

/*  This  function  finds  a  number  that  is  most  often  repeated  in  an  array  of 
integer  values,  and  returns  that  number.  Length  of  array  shoud  be  less  than 
100.  It  is  supposed  that  number  is  an  integer  greater  than  zero, 
vector  is  a  pointer  to  the  array,  count  is  the  number  of  times  that  the 
number  is  repeated.  Length  is  the  length  of  the  vector. 

*/ 

int  FindMa.\(int  ‘vector,  int  ‘count,  int  Length,  int  Max) 

{ 


=(vecl  [k]-vec2[k])‘(vecl  [k]-vec2[k]), 
=  po\v(vecl[k]-vec2[k] ,  2); 


int  i,  j,  m; 
inttllOlJ; 

if(Ma\>100)  Max=100; 
for(i=0;  i<Max+l;  i-H-) 
t[i]=0; 

for(i=0;  i<Lcngth;  i-H) 
tIvector[i)]-H; 

m=t[l]; 

j=i; 

for(i=l;  i<Max-i-l;  i-H)  { 
if(t[i]>m)  { 

m=t[i]; 

j=i; 

} 

} 

*count=m; 
return  (j); } 
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/*  This  program  implements  a  fuzzy  version  of  K-nearest  neighbor  dassifler. 
created  by:  Shahab  Layeghi 

created:  9/1/93 
last  modified:  9/3/93 

*/ 

/*  The  main  program  opens  a  matlab  data  file,  reads  the  training  matrix, 
classifies  each  entrj-  in  the  testing  matrix,  and  writes  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "fdatafile.mat".  As  the  name  implies  it  is  in  matlab  file  format. 

The  data  in  this  file  should  have  the  following  order: 

1.  A  single  variable  'C  which  is  the  number  of  classes. 

2.  A  single  variable  'K'  which  is  the  parameter  'K'  in  K-NN  Algorithm. 

3.  A  single  variable  'M'  which  is  the  coefficient  in  fuzzj’  algorithm. 

4.  A  trainig  matrix  'P  which  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

5.  A  class  membership  matrix  T  which  contains  the  membership  values  of  the 
training  set  vectors  to  the  classes. 

6.  An  input  matrix  'U'  w  hich  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  Fuzzj  KNN  routine  to  classi^'  each  one  of  the  input 
vectors  and  saves  the  results  (the  classes  that  these  inputs  belong  to)  in  a 
file  called  "foutfile.mat".  This  file  is  in  Matlab  format.  This  file  contains 
a  single  variable  called  fresult.  It  is  a  vector  of  the  classes. 

This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escpae 
character  ’!'.  An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "flumtest.m". 

*/ 

#include  <stdio.h> 

^include  <stdlib.h> 

#include  <time.h> 

#include  <math.h> 

#include  <conio.h> 

#define  INPUTFILE  "fdatafil.mat" 

#define  OUTPUTFELE  "foutfile.mat" 

//  Function  Prototypes - 

void  FuzzyKNN(double  *Input,  double  *Samples,  double  *Lables,  double  *Result); 
double  FindDislance(double  *vecl,  double  *vec2); 
double  Ma\d(double  *vec,  int  ♦index,  int  Length); 
int  FindMax(int  *vector,  int  *count,  int  Length,  int  Max); 
int  loadmat(FE,E  *  fp,int  ♦type,  char  ♦pname,  int  ♦mrows,  int  ♦ncols, 
int  ♦imagf,  double  ♦♦preal,  double  ♦♦pimag); 
void  sa\  emat(FILE  ♦^,  int  type,  char  ♦pname,  int  mrows,  int  ncols. 
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int  imagf,  double  *preal,  double  ♦pimag); 


//  Global  variables,  these  variables  will  be  set  by  reading  matlab  file - 

int  Classes;  I*  the  number  of  classes  */ 

int  features;  /*  Number  of  features  in  a  class  */ 

int  KK  ;  /*  K  in  K-nearest  neighbors  */ 

int  SampleSize;  /*  Number  of  Labled  Samples  */ 

int  TestSize; 

double  M;  /*  Coefficient  in  fuzzy 

algorithm 

//. - 


/• - */ 

void  main() 

{ 

double  *Lables; 
double  *KP; 
double  ♦input; 
int  ij; 

FILE  *fp; 
char  name[20]; 
int  tjpe,  imagf; 

double  *  Samples,  *isamples;  //  isamples  is  for  imaginary  part  of  the  matrix  that  is  not  used  in 

here, 

double  *Tesldata; 

double  ♦result;  //  pointer  to  the  result  matrix 

double  ♦iresult;  //  result  vector  of  classification  of  a  single  vector 

fp=fopen(INPUTFILE,"rb’'); 
if(!ip)  { 

printfC'cannot  open  the  file"); 
exit(-l); 

} 

//  read  classes  from  the  file 

loadmat(fp,  &t)pe,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 
if(i!=l||j!=l){ 

printf("error:  You  should  include  classes  at  the  begirming  of  the  file\n"); 
exit(-l); 

) 

Classes=^KP; 

//  read  KK  fi'om  the  file 

loadmat(fp,  &t>pe,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 

if(i!=l||j!=l){ 

printfC'error:  You  should  include  K  at  the  beginning  of  the  file\n"); 
exit(-l); 

} 

KK=^KP; 


48 


//  read  M  from  the  file 

loadmat(fp,  &t)pe,  name,  &i,  &j,  &imagf,  &KP,  &isamples); 
if(i!=l||j!=l){ 

printfC'error:  You  should  include  M  as  the  thrid  parameter\n"); 
exit(-l); 

} 

M=*KP; 

//  read  the  matrix  from  the  datafile. 

loadmat(fp,  &tjpe,  name,  &features,  &SampleSize,  &imagf,  &Samples,  &isamples); 
//  reading  lables  from  data  file 

loadmat(fp,  &t>pe,  name,  &i,  &j,  &imagf,  &Lables,  &isamples); 
if(i!=l  II  j!=SampleSize)  { 

printfC'error:  Number  of  labels  is  different  from  the  number  of  samples\n"); 
exit(-l); 

} 

//  read  data  to  be  classified  from  the  file 

loadmat(fp,  &t>pe,  name,  &i,  &TestSize,  &imagf,  &Testdata,  &isamples); 
if(i  !=  features)  { 

printfC'error:  Training  and  testing  matrices  should  have  the  same  size"); 
exit(-l); 

} 

//  Allocate  space  for  result  vector 

result  =  (double  *)  malloc(TestSize'*Classes'*sizeof(double)); 
if(!  result)  { 

printf("Error:  cannot  allocate  memoij’  for  the  result  Matrix"); 
exit(-l); 

} 

for(j=0;  j<TestSize;  j++)  {  //  for  each  input 

input=Testdata+j'"features; 

FuzzjKNN(input,  Samples,  Lables,  iresult); 

//  printf^C'\n  Memberships:"); 

for(i=0;  i<Classes;  1++)  { 

result[j*Classes+i]=iresult[i]; 
printf("  %lf  ",  iresult[i]); 

} 

} 

fclose(^); 

H  printf("\n  End  of  classification.  Now  writing  the  result  in  the  file"); 

fp=fopen(OUTPUTFILE,  "wh"); 
if(!fp)  { 

printf("Error:  Cannot  write  the  file"); 
getchO; 

} 

sa\emat(fp,  0,  "fresult".  Classes,  TestSize,  0,  result,  result); 
fclose(fp); 
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} 


/♦ - •/ 

/♦  This  is  a  fuzzj'  K  Nearest  neighbor  classifier  routine.  Input  is  the 
vector  to  be  classified,  Samples  is  the  matrix  of  classified  samples, 

Lables  is  the  vector  of  the  classes  that  these  samples  belong  to. 

Result  is  the  vector  of  membership  values  of  Input  to  each  class. 

*/ 

void  Fuzz)KNN(double  *Input,  double  ^Samples,  double  *Lables,  double  ♦Result) 

{ 

int  ij,n ; 
int  nj,  k,  nk; 
double  *distance; 
int  ♦index; 
double  x,y; 

double  ♦membership;  //  pointer  to  membership  matrix 

double  nsum,  dsum,  temp; 

/♦  This  section  builds  a  fuzz>’  membership  matrix  from  the  lables. 

Membership  of  each  sample  to  the  class  that  it  belongs  to  is  assigned 
to  1,  and  the  membership  of  it  to  other  classes  is  assigned  to  0  ♦/ 

membership  =  (double  ♦)  malloc(SampleSize^Classes^sizeof(double)); 
if(!  membership)  { 

printfC'Enor:  Not  enough  memoiy  for  membership  matrix"); 
exit(-l); 

} 

for(i=0;  i<SampleSize^Classes;  i++) 

♦(membership+i)=0;  //  Initializing  matrix  to  zero 

for(j=0;  j<SampleSize; )++)  { 
i=^(Lables+j); 

♦(membership+i^SampleSize+j)=l ; 

} 

distance  =  (double  ♦)  malloc(KK^sizeof(double));  //  allocate  space  for  the  vector 
if(!  distance)  { 

printfC'Error:  Not  enough  memor>’  for  distance  vector"); 
exit(-l); 

} 

index  =  (int  ♦)  malloc(KK^sizeof(int)); 
if(!  index)  { 

printfC'Error;  Not  enough  memoi)’  for  index  vertor"); 
exit(-l); 

} 

for(i=0;  i<KK;  i+-f)  {  //  This  loop  initializes  K  nearest  neighbors  to  the  first  K  Samples 
index[i]=i; 

distance[i]=FindDistance(Input,  &Samples[i^features]); 

} 

for(i=KK;  i<SampleSize;  i-H-)  {  //  This  is  the  loop  that  finds  the  K  nearest  Neighbors 
x=Maxd(distance,  &j,  KK); 
j— FindDistance(Input,  &Samples[i^features]); 

if(j’<x){  //  This  sample  is  closest  to  the  input  than  the  farthest  K  Neighbors 
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distance[j]=y; 

indexD]=i; 

> 

} 

for(j=0;  j<Classes;  j-H-)  { 
nsum=dsuin=0; 
for(n=0;  n<KK;  n-H-)  { 
i=index[n]; 

temp=FindDistance(Input,  &Samples[i*features]); 
if(teinp  <  le-10)  { 

zero 

Result|j]=membership[j*SampleSize+i] ; 
break; 

} 

if(M  =  2) 

temp=l/temp; 
else  if(M  !=  1) 

temp=povv(  1/temp,  1/(M-1)); 
else 

temp=0; 

nsum  +=  membership[j*SampleSize+i]*temp; 
dsum  +=  temp; 

} 

if(dsum  !=0) 

ResultU]=nsum  /  dsum; 

} 

free(membership); 

free(distance); 

free/index); 

} 


/* - */ 

/♦  This  function  returns  the  Euclidian  distance  between  two  vectors  *! 

double  FindDistance(double  ♦vecl,  double  ♦vec2) 

{ 

int  k; 

double  distance; 
distance  =  0; 

for(k=0;  k<features;  k-H-)  { 

distance  +=  (vecl[k]-vec2[k])*(vecl[k]-vec2[k]); 
//  distance  +=  pow(vecl [k]-vec2[k] ,  2); 

} 

return  distance; 


/* - */ 

/*  This  function  finds  the  biggest  element  of  an  array.  It  returns  that 
value  and  also  returns  the  index  to  that  element  in  index. 

*/ 


//If  distance 


double  Maxd(double  *vec,  int  *index,  int  Length) 

{ 

int  ij=0; 
j=0; 

for(i=l;  i<Length;  1++) 

if(vec[i]>vec|j])  j=i; 

♦index=j; 

return(veclj]); 

} 

/♦ - •/ 

I*  This  function  finds  a  number  that  is  most  often  repeated  in  an  array  of 
integer  values,  and  returns  that  number.  Length  of  array  shoud  be  less  than 
100.  It  is  supposed  that  number  is  an  integer  greater  than  zero, 
vector  is  a  pointer  to  the  array,  count  is  the  number  of  times  that  the 
number  is  repeated.  Length  is  the  length  of  the  vector. 

*/ 

int  FindMax(int  ♦vector,  int  ♦count,  int  Length,  int  Max) 

{ 


int  i,  j,  m; 
intt[101]; 

if(Max>100)  Max=100; 
for(i=0;  i<Max+l;  i-H-) 
t[i]=0; 

for(i=0;  i<Length;  i-H-) 
t[vector[i]]-H-; 

m=t[l]; 

j=i; 

for(i=l;  i<Max-t-l;  i-H-)  { 
if(t[i]>m)  { 

m=t[i]; 

j=i; 

} 

} 

♦count=m; 
return  (j); 


} 
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I.  Project  Summary 

Polygraph  testing  has  been  used  as  a  technique  for  measuring  deception  throughout  the  twentieth 
century.  Throughout  most  of  this  time  period  the  task  of  interpreting  the  data  has  rested  solely 
on  the  trained  examiner.  Recently,  automated  computer  evaluation  of  the  polygraph  using  statis¬ 
tically  derived  discrimination  functions  has  begun  in  an  effort  to  aid  the  polygraph  examiner. 

The  purpose  of  this  proposed  study  is  to  continue  the  work  begun  under  ONR  Grant  NOOO 14-93- 
1-0570  to  investigate  the  use  of  fuzzy  set  classification  to  perform  the  data  analysis.  In  that  previ¬ 
ous  study  it  was  shown  that  fuzzy  membership  functions  can  accurately  classify  the  MGQT  poly¬ 
graph  data  at  greater  than  90%  accuracy  levels.  This  study  will  focus  on  optimizing  the  fuzzy 
classifier  further,  test  the  classifier  on  ^ne  Comparison  Data  as  well  as  MGQT,  and  adapt  the 
algorithm  for  use  in  a  real-time  testing  scenario.  At  the  completion  of  this  project,  a  software 
program  will  be  delivered  that  will  perform  classification  of  the  polygraph  data  on  an  80486 
based  personal  computer. 
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n.  Project  Description 

A.  Objectives  of  Proposed  Project 

The  objectives  of  the  proposed  project  are  to: 

(1)  study  the  relationship  between  the  fuzzy  classifier  and  the  success  of  classification; 

(2)  test  the  optimized  algorithm  on  both  MGQT  and  zone-comparison  data; 

(3)  and  investigate  the  algorithm  in  a  real-time  testing  scenario. 


B.  Introduction 
1.  The  Polygraph 

The  ability  to  directly  measure  the  signals,  both  mechanical  and  electrical,  emanating  from  the 
living  human  body  has  been  around  for  hundreds  of  years.  Ever  since  the  beginning  of  these  ob¬ 
servations  the  interpretation  of  biological  data  has  been  used  to  understand  the  physiology, 
pathology,  neurology,  and  psychology  of  the  living  human.  In  the  late  1800’s,  study  began  on  in¬ 
terpreting  biological  data  in  an  effort  to  better  understand  one  particular  aspect  of  human  cogni¬ 
tive  psychology  -  deceit  (Lombroso,  1895).  Specifically,  respiration  rate,  heart  rate/blood  pres¬ 
sure,  and  galvanic  skin  response,  measured  by  a  device  known  as  the  polygraph,  were  used  to  de¬ 
termine  whether  a  person  was  telling  the  truth  or  lying.  Over  the  past  90  years  this  device  has 
been  used  with  varying  degrees  of  success.  Because  of  its  recent  and  abundant  use  in  criminal  in¬ 
vestigations  and  employee  screening,  the  accuracy  of  the  test  has  become  increasingly  critical. 

Tw'o  of  the  leading  causes  of  failure  of  the  polygraph  test  to  accurately  and  definitively  assess  a 
subject’s  veracity  are  the  individual  administrator’s  variability  in  interpreting  the  polygraph  data 
and  the  complexity  of  the  interpretation  protocols  (Office  of  Technology  Assessment,  1983).  To 
overcome  this  shortcoming  of  the  polygraph  test,  recent  work  (see  Olsen,  1991)  has  focused  on 
the  use  of  computers  for  interpretation  of  the  biological  data. 

One  technique  used  for  computer  analysis  of  the  polygraph  involves  two  steps  (Olsen,  1991  and 
Kircher,  1988,  for  example).  First  the  data  is  described  by  approximately  20  parameters  (descrip¬ 
tors)  which  have  been  determined  to  be  important  in  the  evaluation  of  the  polygraph.  Second, 
this  data  is  evaluated  using  statistical  discriminant  analysis  to  "construct  an  optimal  linear  combi¬ 
nation  of  physiological  measures  for  diagnosing  truth  and  deception"  (Kircher,  1988).  The  results 
of  Kircher’s  work  showed  that  by  using  a  derived  discriminating  function  and  arbitrary  threshold 
level,  the  computer  could  equal  (and  actually  exceed)  the  performance  of  an  experienced  poly¬ 
graph  examiner.  This  discriminating  function  reinforced  the  observation  that  the  Galvanic  Skin 
Response  is  the  most  important  indicator  of  a  subjects  truthfulness. 
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Two  questions  that  arise  from  this  type  of  discriminant  analysis  are: 

1)  Is  this  discriminant  function  and  threshold  level  optimal  for  all  subjects? 

2)  Are  there  other  possible  descriptors  of  the  polygraph  which  would  yield  even  more  informa¬ 
tion  about  the  subject’s  veracity? 

The  use  of  fuzzy  set  theory  may  shed  light  on  these  questions,  and  in  so  doing  may  produce  an 
even  more  accurate  polygraph  analysis. 

2.  Fuzzy  Logic 

Signals  can  be  generally  classified  into  three  categories;  deterministic,  probabilistic,  and  possi- 
bilistic  (fuzzy  events).  In  the  case  of  biological  data  the  patterns  are  probabilistic  or  possibilistic 
because  they  generally  contain  a  large  random  component.  As  mentioned  previously,  computer 
scoring  of  the  polygraph  relies  on  probabilistic  discrimination  functions  and  an  arbitrary  thresh¬ 
old  to  classify  the  data.  Fuzzy  set  theory,  however,  defines  the  concept  of  a  possibilistic  distribu¬ 
tion  as  a  fuzzy  restriction  which  acts  as  an  elastic  constraint  on  the  values  that  may  be  assigned 
to  a  variable  (Zadeh,  1977).  "A  fuzzy  variable  is  associated  with  a  possibility  distribution  in 
much  the  same  way  as  a  random  variable  is  associated  with  a  probability  distribution."  (Zadeh, 
1977) 

The  key  to  fuzzy  logic  is  that  classes  of  objects  exist  with  a  continuum  of  grades  of  memberships 
(Zadeh,  1965)  so  that,  unlike  probabilistic  discrimination  functions,  no  arbitrary  threshold  is 
needed.  Rather,  classification  is  made  according  to  the  degree  of  membership  in  a  given  class.  In 
addition,  the  membership  function  itself  can  be  automatically  adapted  for  a  given  training  set 
composed  of  data  and  its  corresponding  class.  This  is  because  the  theory  of  possibility,  as  com¬ 
pared  to  the  theory  of  probability,  relates  to  the  perception  of  degrees  of  evidence  instead  of  de¬ 
grees  of  likelihood  (Zadeh,  1977). 

Figure  #1  shows  the  components  of  a  fuzzy  set  classification  system  (Martin,  1982).  One  can 
see  that  the  learning  mechanism  is  only  active  in  the  adaptation  of  the  partition  (membership 
function).  This  system  operates  on  a  set  of  data  xn(t)  which  are  extracted  from  a  training  set,  that 
is; 

x(t)  =  [xi(t),  X2(t), ...,  xn(t)]  with  0  <  xi(t)  <  1 

The  process  of  extracting  the  data  set  from  the  training  set  is  completed  in  the  "Signal  Process¬ 
ing"  step.  This  step  is  also  known  as  the  data  parser. 
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Figure  #1;  Fuzzy  Logic  Classification  System 
For  each  class  Ck,  there  corresponds  a  vector  of  descriptors  characterizing  the  class. 

.  Pk  *  [  Pl.k<  P2.k.  •••>  Pn.k  ]  0  £  p|,k  Si 

The  estiinated  parameter  set,  pk,  defines  a  set  of  membership  functions,  pk.  The  degree  of  mem¬ 
bership  of  a  set  X  <  U  to  a  class  Ck  is  given  by  the  function 

Pk  •  n  Pi'.k  ^  ^  ‘  ^  * 

The  form  of  the  above  function  is  not  unique.  The  function  has  a  maximum  value  when  xi  is 
equal  to  pgc  as  shown  in  Figure  #2.  For  example,  for  input  data  xi «  0.7,  the  grades  of  member¬ 
ship  functions  uijc  are  0.4, 0.3, 0.55,  and  051  corresponding  to  pa  equal  to  0.2, 0.3, 0.7,  and  0.8, 
respectively.  Thus  the  mflYimiim  value  is  0.55  when  pais  0.7,  which  is  equal  to  XL  Note  the 
maximum  value  is  not  necessarily  1.  Each  data  xi  has  equal  contribution  to  the  membexsh^ 
function  pk.  The  possibility  that  the  value  xi  is  an  object  in  class  Ck  depends  on  the  degree  of 
membership  of  xi  to  Ck.  The  decision  for  the  assignment  of  elements  to  classes  depends  on  Ae 
values  of  the  maximum  membership.  That  is: 

Ck  =  ntax  Pi(x)  where  0<i,  k<n 

t 

If  the  decision  disagrees  wiA  Ae  training  set,  Ac  parameter  pk(t)  will  be  modified  to  pk(t+l): 

Pk(t+1)  -  PkW  +  (  xd)  •  Pk(t)  ) 

Nk(t)+1 
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Figure  #2:  Example  ..e  leaching 

For  the  polygraph  there  are  p  rafculTte  ta  ow'n  memSmhip  functions.  Also' 

the  statistical  approach,  the  fuzzy  c  a  ^  however,  many  membership  functions  for 

Z  has  been  one  of  the  focusses  of  the  project. 

C.  Proposed  Research 
1.  Hypothesis 

Worhbytheaumorandhisrad^«~ 

have  used  parametenzauon  techmqu  P  ^l^avior  of  the  signal  and  is  prone  to  being 

tion  requires  a  detailed  "Gestalt  know  ^ge  I,  optimality  of  these  pa- 

inaccurate  over  a  wide  range  o  J  •  joblems  the  author  has  used  only  the  EEG  time 
rameters  is  obtained.  In  order  to  ^  set.  It  is  theorized  that  this  approach 

definition  of  successful  classificaoon.) 
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It  is  also  believed  that  the  transformation  of  the  polygraph  data  into  the  frequency  domain  will 
allow  the  fuzzy  set  to  detect  such  known  parameters  as  baseline  shift  and  amplitude  modulation 
in  the  respiration  rate  (suppression  and  staircase  suppression  can  both  be  classified  as  amplitude 
modulation).  For  the  same  reason  it  is  conjectured  that  the  auto-  and  crosscorrelation  of  the  three 
different  data  types  will  present  any  coirelated  behavior  of  the  biological  signals  to  the  fuzzy 
classifier. 

In  fact,  the  work  proformed  under  the  previous  polygraph  proposal  has  begun  to  confirm  these 
hypotheses  (see  Appendix  A,  "Progress  Report").  Over  600  features  in  time  and  frequency,  both 
individual  and  cross-correlated,  were  examined.  With  little  optimization  of  the  fuzzy  classifica¬ 
tion  algorithm,  classification  levels  of  greater  than  ninety  percent  were  achieved. 

2.  Methods 

While  time-shortened  by  almost  half  a  year,  three  out  of  the  five  previous  project  goals  were 
achieved.  (Once  again,  please  see  Appendix  A,  "Progress  Report"  for  a  complete  report.)  First,  a 
program  for  parsing  the  MGQT  data  has  been  developed.  This  program  extracts  all  waveforms 
from  the  the  case  files  and  parses  the  data  taking  into  account  the  fact  that  some  MGQT  ques¬ 
tions  may  be  asked  out  of  order,  may  not  always  be  repeated  three  times,  and  may  not  be  asked  at 
all. 

Second,  a  fuzzy  classifier  has  been  created,  based  on  the  fuzzy  k-nearest  neighbor  algorithm. 

This  algorithm  returns  a  continuous  truth  versus  deception  value  between  zero  and  one.  It  was 
trained  on  25  truthful  and  25  deceptive  files,  and  achieved  91%  accuracy  on  another  set  of  25 
truthful  and  25  deceptive  files. 

Lastly,  relationships  between  feature  sets  and  classifier  success  were  determined.  Of  great  im¬ 
portance  was  that  the  set  of  four  best  features  had  a  feature  from  each  one  of  the  physiological 
channels. 

In  the  proposed  study,  three  goals  will  be  achieved.  First,  a  study  of  the  relationship  between 
fuzzy  classifiers  and  success  of  classification  will  be  performed.  There  are  several  forms  of 
fuzzy  classifiers,  using  both  unsupervised  and  supervised  learning.  The  first  phase  of  this  project 
will  focus  on  which  algorithm,  or  combination  of  algorithms  will  be  optimal.  Specifically,  a  su¬ 
pervised  adaptive  fuzzy  mebership  function  algorithm  will  be  compared  with  the  fuzzy  k-nearest 
neighbor  algporithm. 

Secondly,  a  comparison  between  the  performance  of  this  algorithm  and  algorithm  used  elsewhere 
is  important  in  understanding  its  benefits.  Therefore,  the  second  phase  of  this  project  will  focus 
on  comparing  our  results  on  the  MGQT  with  our  results  on  the  more  common  zone  comparison 
test. 

Finally,  the  ultimate  goal  of  this  project  is  to  create  a  program  that  will  assist  the  polygraph  ex¬ 
aminer  in  evaluating  a  subject  while  the  examination  is  in  progress.  To  do  this,  the  algorithm 
will  output  a  fracional  number  from  0  to  1  (0  meaning  deceptive  and  1  meaning  truthful)  after 
each  examination  question.  If  the  question  is  a  control  or  irrelevent  question,  the  examiner  will 
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tell  the  program  this,  and  it  will  leam,  in  real-time  accordingly.  Because  the  examiner  is  given  a 
continuous  measure  of  truth  or  deception,  he/she  can  now  focus  on  questions  that  are  yielding  in¬ 
determinate  results. 


D.  Deliverable 

At  completion  of  this  project,  a  highly  optimized,  real-time,  automatic  polygraph  algorithm  will 
be  delivered  on  IBM  PC  compatable  discs.  This  program  will  run  on  a  80486  PC  or  faster  with 
at  least  4MB  of  memory.  In  addition,  as  was  done  on  the  previous  projea,  a  complete  report  will 
be  written  documenting  all  results  and  operations  of  the  algorithm. 
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V.  Budget 

One  Year  Budget  Summary 
January  1, 1994  -  December  31, 1994 


AMOUNT 
KtUUtib  1  bU 


PERSONNEL 

Salaries: 

Principal  Investigator: 

Ben  Knapp  *  20%  Release  Time  AY 

$10,214 

Graduate  Student  Assistants  (1): 

6  mos.  (a>  50%  @  $8.50/hr 

$4,386 

3  mos.  @  100%  @  $8.50/hr 

$4,386 

Total  Salaries 

$18,986 

Fringe  Benefits: 

Students  (2)  5% 

$439 

Release  Time  (S)  34% 

$3,473 

Total  Fringe  Benefits 

$3,911 

TOTAL  PERSONNEL 

$22,898 

EQUIPMENT 

$0 

SUPPLIES 

$1,000 

TOTAL  DIRECT  COSTS 

$23,898 

INDIRECT  COSTS  @  49%  MTDC 

$11,710 

TOTAL  PROJECT  COSTS 

$35,608 
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Appendix  A 

Progress  Report 


1 .  Overview 

A.  Development  of  Data  Parsing  Algorithm 

The  first  phase  cf  this  project  was  tc  bs  able  tc  read  the 
MGQT  data  files  received  from  the  NSA  and  separate  this  data 
into  appropriate  features  for  classification.  After 
consulting  with  the  University  of  Washington,  we  were  able 
to  develop  our  own  data  reading  program. 

After  consultation  with  experienced  polygraph  examiners  and 
a  detailed  review  of  the  polygraph  literature,  the  data 
reading  program  was  then  modified  to  parse  the  data  into  a^ 
matrix  of  features.  The  feature  set  included,  as  outlined  in 
the  project  proposal,  time  domain,  frequency  domain,  and 
correlation  domain  data.  Some  examples  of  the  feature  set 
are: 


Time  Domain  Features 

-  Mean,  curvelength,  area,  and  standard  deviation  for  all 
polygraph  channels 

-  Average  of  the  amplitudes  of  the  peaks  in  the  cardio  and 
respiratory  channels 

-  Derivative  of  the  amplitudes  of  the  peaks  of  cardio  and 
respiratory  channels 

-  Number  of  peaks  in  the  cardio  and  respiratory  channels 

-  Inhalation  amplitude/exhalation  amplitude  of  respiratory 
channels 


Frequency  Domain  Featiires 

-Fundamental  frequency  of  cardio  and  respiratory  signals 
-Coherency  and  cross  power  spectral  density  between  cardio 
and  respiratory  channels 

-Power  spectral  density  of  cardio  and  respiratory  channels 
-Integrated  power  spectral  density  for  cardio  channel 

Correlation  Domain  Features 

-  Autoregressive  parameters  (10)  for  cardio  signal 

-  Cross-correlation  between  cardio  and  respiratory  channels 

B.  Design  of  Fuzzy  Classifier  Algorithm 

Fuzzy  classifier  design  has  focused  on  the  development  of  a 
fuzzy  set  based  k  nearest  neighbor  algorithm.  The  algorithm 
learns  using  a  set  of  MGQT  data  divided  equally  between 
truthful  and  deceptive.  Since  there  were  150  deceptive 
files  and  only  50  truthful  files,  the  deceptive  files  were 
divided  into  three  sets  of  50  files  each.  The  algorithm  was 
trained  separately  for  each  data  set.  When  a  question  was 
asked  more  than  once  by  an  examiner  the  questions  were 


scored  individually  and  then  combined  at  the  end  on  a 
majority  basis.  Some  examples  of  the  results  achieved  using 
the  best  four  features  and  no  indecision  allowed  are: 


Deceptive  Sgt. 

S Correct  ^Correct 
Deceptive  Truthful 

% Correct 
Total 

1 

94  78 

86 

2 

89  72 

80 

3 

100  83 

91 

The  following  are  three  reports  which  describe  in  detail  the 
work  performed.  In  addition,  a  copy  of  a  paper  which  has 
been  submitted  to  the  IEEE  International  Conference  on  Fuzzy 
Systems  is  also  included.  Finally,  a  manual  is  included 
which  instructs  the  user  how  to  repeat  the  work  performed  at 
SJSU. 
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I.  Introduction 


A  great  amount  of  work  has  been  done  on  the  application  of  fuzzy  logic  techniques  for  pattern 
recognition.  In  this  study  some  of  the  more  important  algorithms  are  summarized  and  conq)ared. 

Pattern  recognition  could  be  defined  as  search  for  structure  in  data.  This  means  organizing  data  in 
groups  in  a  way  that  members  of  each  group  have  some  kind  of  similarity.  A  system  that  does  this  job  is 
called  a  classifier.  A  classifier  can  be  designed  by  a  human  expert  and  be  used  to  classify  the  data  (fixed 
design).  Another  approach  is  to  provide  the  classifier  with  the  data  and  make  it  adapt  itself  according  to 
the  data  that  it  receives.  Adaptive  systems  can  be  divided  into  two  main  categories,  supervised  and 
unsupervised. 

In  supervised  learning,  another  system  (or  a  human  expert)  which  is  usually  called  a  teacher, 
furnishes  the  classifier  with  the  group  that  each  data  item  belongs  to,  so  that  classifier  can  learn  from  a  set 
of  labeled  input  data  and  be  able  to  classify  new  data.  This  process  is  called  training. 

In  unsupervised  learning,  which  is  also  called  clustering,  the  system  is  given  a  set  of  unlabled  data, 
and  it  is  expected  that  it  find  internal  similarities  between  the  data  items  and  put  them  in  different  groups 
accordingly.  If  data  are  represented  quantitatively  as  vectors  in  a  vector  space,  data  that  are  spatially  close 
should  be  put  in  one  group. 

In  the  section  ,a  method  of  classification  is  described  which  uses  fuzzy  linguistic  variables.  This 
method  uses  human  experts  to  train  the  system  and  then  uses  the  labeled  linguistic  samples  to  refine  the 
classifier.  In  section  2,  C-Means  Algorithm  which  is  a  clustering  method  is  explained.  Section  3  covers 
K  Nearest  Neighbor  algorithm  which  is  a  supervised  classification  method. 


After  polygraph  files  were  decoded  and  put  in  a  directory,  they  could  be  processed  using 
Matlab.  It  was  tried  to  write  the  programs  in  a  structured  way  so  that  creating  and 
debugging  of  individual  sections  would  be  easier  and  program  segments  would  have  direct 
conformity  with  conceptual  block  diagrams.  At  the  lowest  levels,  there  are  many  Matlab 
routines  that  operate  on  pieces  of  data  and  extract  features  from  them,  and  return  these 
features  to  the  calling  routines.  At  the  top,  there  is  a  Matlab  program  that  extracts  the 
features  for  all  the  files  in  a  directory  and  saves  it  as  a  matrix.  The  structure  of  these 
programs  is  explained  in  the  following  sections. 

The  main  feature  extraction  program  is  a  Matlab  routine  called  newfeat.  This  program 
finds  the  features  for  the  files  in  a  directory  and  saves  the  features  in  a  matrix.  The  main 
part  of  the  program  is  a  loop  that  extracts  the  features  of  a  single  file  and  puts  them  in  a 
vector.  This  action  is  repeated  for  all  the  files  that  their  name  is  given.  In  order  for  the 
Matlab  program  to  find  the  files  to  processed  in  a  directory,  a  C  program  was  written  that 
searches  in  a  directory  and  saves  all  the  names  of  all  the  files  that  it  finds  in  an  ASCII  file 
containing  a  Matlab  matrix.  This  C  program  is  called  'flist'  and  could  be  found  in  the 
\polygrap\project\source  directory.  The  way  this  program  works  is  explained  below; 

/*  This  program  lists  the  files  in  a  dos  directory  and  saves  this 
listing  in  a  file  called  files.m.  This  file  is  actually  a  Matlab  script 
that  contains  a  matrix  called  'flist'  which  holds  a  filename  in  each  row. 

The  first  character  of  file  names  can  be  given  to  this  program  as  an 
input  argument. 

Ex; 

flist  t 

is  equal  to  use  the  dos  command 
dir  t*.* 

and  save  the  result  in  a  Matlab  m  file  called  files.m 


*/ 

After  running  the  flist.exe  program  in  a  directory,  and  checking  that  the  appropriate 
filenames  are  saved  in  the  files.m  file,  the  Matlab  program  can  use  them  by  executing  the 
command 
files 

and  using  the  variable  flist. 

Another  important  data  item  that  is  used  in  the  feature  extraction  programs  is  called 
featurejist.  It  is  a  Matlab  matrix  that  includes  the  names  of  feature  extraction  routines. 
In  each  row  of  the  feature  jist  matrix  a  feature  extraction  routine  is  named  along  with  the 
channel  number(s)  that  this  routine  will  be  applied  to.  For  example 


lOmean(frag)' 


means  to  apply  the  mean  function  to  a  piece  of  data  called  frag,  which  is  defined  later. 

The  channel  that  data  is  to  be  gathered  from  is  channel  1 .  As  another  example 

'26crosscor(frag,  fragS)' 

means  to  apply  the  fiinction  crosscor  to  two  pieces  of  data  coming  from  channels  2  and  6, 
in  variables  called  frag  and  frag3. 

featurejist  is  defined  in  newfeat  program.  All  the  features  that  are  extracted  from  the 
data  are  listed  in  it.  If  a  new  feature  is  to  be  investigated,  it  is  enough  to  write  a  program 
that  extracts  it,  and  add  that  program  name  in  this  list. 

Note;  It  is  highly  recommended  that  the  programs  newfeat,  feature,  and  processf  be  read 
carefully  before  making  any  changes  in  feature  list. 

Before  being  able  to  do  any  processing  on  the  data,  for  each  data  file  another  file  should 
be  created  that  holds  the  types  of  the  questions.  These  files  are  named  zz/iflme.0x4.  Note 
that  these  files  are  not  a  standard  part  of  axciton  files  and  were  created  here  by  referring  to 
the  question  files  and  data  sheets  that  accompanied  each  the  files.  The  format  of  these 
files  is  as  follows: 

X  0  0  0  0 

al  bl  cl  dl  el 

a2  b2  c2  d2  e2 

a3  b3  c3  d3  e3 

X  is  either  one  or  zero.  1  means  the  file  is  deceptive,  and  zero  means  it  is  non-deceptive. 
The  rows  2,  3  and  4  in  this  file  show  the  numbers  of  relevant,  irrelevant,  and  control 
questions.  For  example  for  a  deceptive  file  in  which  questions  3,  5,  8  and  9  are  relevant, 
questions  1,  2,  4,  and  7  are  irrelevant,  and  questions  6  and  10  are  control,  a  question  file  is 
constructed  that  looks  like  this: 

1  0  0  0  0 

3  5  8  9  0 

1  2  4  7  0 

6  10  0  0  0 


The  newfeat  program,  for  each  data  file  which  is  listed  in  flist,  loads  the  above  mentioned 
question  file  to  find  the  question  types.  Then  it  calls  the  actual  feature  extraction  routine 
which  is  called  feature.  The  program  feature  finds  all  the  features  for  each  relevant, 
irrelevant,  and  control  question  and  returns  the  results  in  a  vector.  This  vector  is  added  as 
a  new  column  to  a  matrix  called  M.  At  the  end  of  the  newfeat  program  the  matrix  M  is 
saved  in  a  file.  This  file  is  manipulated  by  another  program  called  processf. 


processf  is  a  program  that  loads  the  M  matrix,  combines  the  features  for  each  question  in 
different  ways  that  are  explained  in  reports  of  Mitra  and  Shahab,  and  saves  the  resultant 
matirx,  the  F  matrix,  in  a  file. 

The  above  procedure  was  repeated  for  the  polygraph  files  in  several  directories.  One  of 
the  directories  contained  files  for  non-deceptive  cases  and  the  other  ones  included 
deceptive  files.  Three  sets  of  data  were  built  by  combining  the  features  for  non-deceptive 
cases  with  three  sets  of  deceptive  files.  Each  data  set  contained  50  deceptive  and  50  non- 
deceptive  cases.  These  sets  were  used  by  classification  programs. 


Classification: 

There  are  two  classifier  programs  written  for  this  project,  fknn  and  cknn,  which  implement 
fuzzy  and  crisp  K-nearest  neighbor  classifiers  accordingly.  These  programs  are  written  in 
C  programming  language.  The  way  they  interact  with  Matlab  is  through  reading  and 
writing  files  in  Matlab  format,  that  is  .mat  files.  There  are  two  C  functions  inside  these 
programs  called  loadmat  and  savemat  which  are  interfaces  to  Matlab  files  and  can  be  used 
to  load  and  save  date,  which  in  Matlab  are  matrices,  from  Matlab  files.  These  two 
functions  are  in  a  file  called  matldsv.c  which  should  be  compiled  with  the  source  files  that 
use  them,  fknn  and  cknn  programs  load  matrices  that  include  the  features  and  were 
prepared  by  Matlab  feature  extraction  routines.  After  loading  the  matrices,  the  feature 
vectors  in  test  matrix  are  classified  individually,  and  the  result  is  saved  in  a  file  as  a  Matlab 
matrix.  The  comments  in  the  source  codes  of  the  programs  cknn  and  fknn  are  repeated 
here  for  reference; 

/*  cknn;  This  program  implements  a  K-nearest  neighbor  classifier. 


The  main  program  opens  a  Matlab  data  file,  reads  the  training  matrix, 
classifies  each  entry  in  the  testing  matrix,  and  writes  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "cdatafil.mat".  As  the  name  implies  it  is  in  Matlab  file  format. 

The  data  in  this  file  should  have  the  following  order; 

1.  A  single  variable  'C  which  is  the  number  of  classes. 

2.  A  single  variable  'K'  which  is  the  parameter  'K'  in  K-NN  Algorithm. 

3.  A  training  matrix  'P'  which  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

4.  A  classes  vector 'T'  which  contains  the  classes  of  the  training  set 

5.  An  input  matrix  'U'  which  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  Crisp  KNN  routine  to  classify  each  one  of  the  input 
vectors  and  saves  the  results  (the  classes  that  these  inputs  belong  to)  in  a 
file  called  coutfile.mat.  This  file  is  in  Matlab  format.  This  file  contains 
a  vector  of  the  classes  called; 


'cresult' 


This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escape 
character An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "cknntest.m". 

*/ 

/*  fknn;  This  program  implements  a  fiizzy  version  of  K-nearest  neighbor  classifier. 

The  main  program  opens  a  Matlab  data  file,  reads  the  training  matrix, 
classifies  each  entry  in  the  testing  matrix,  and  writes  the  result  in  an 
output  file.  The  file  that  this  program  gets  the  information  from  should  be 
called  "fdatafile.mat".  As  the  name  implies  it  is  in  Matlab  file  format. 

The  data  in  this  file  should  have  the  following  order; 

1.  A  single  variable  'C  which  is  the  number  of  classes. 

2.  A  single  variable  'K'  which  is  the  parameter  'K'  in  K-NN  Algorithm. 

3.  A  single  variable  M'  which  is  the  coefficient  in  fuzzy  algorithm. 

4.  A  training  matrix  'P'  which  contains  a  set  of  feature  vectors.  Each  vector 
is  in  a  column  of  the  matrix. 

5.  A  class  membership  matrix  T  which  contains  the  membership  values  of  the 
training  set  vectors  to  the  classes. 

6.  An  input  matrix  'U'  which  contains  a  set  of  unclassified  feature  vectors. 

The  main  program  uses  the  Fuzzy  KNN  routine  to  classify  each  one  of  the  input 
vectors  and  saves  the  results  (the  memberships  of  the  inputs  to  classes)  in  a 
file  called  "foutfile.mat".  This  file  is  in  Matlab  format.  This  file  contains 
a  single  variable  called  fresult.  It  is  a  matrix  of  the  memberships  of  the 
inputs  to  the  classes. 

This  program  can  be  called  from  dos,  or  within  Matlab  by  using  dos  escape 
character '!'.  An  example  Matlab  script  file  that  shows  how  this  program  can 
be  used  is  included  in  the  file  "fknntest.m". 

*/ 

As  mentioned  above,  the  programs  fknn  and  cknn  are  the  actual  classifiers  which  can  be 
called  directly  from  dos  or  within  a  Matlab  program.  Several  Matlab  programs  were 
written  that  used  these  two  programs  for  classification  of  data.  The  Matlab  programs 
acted  mostly  as  a  front  end  or  user  interface  for  the  classifier  programs.  A  listing  of  many 
Matlab  programs  and  functions  is  included  as  an  appendix  in  this  report.  Understanding  of 
all  the  functions  is  not  necessary  because  they  are  used  inside  the  programs.  Some  of  the 


programs  were  created  to  test  other  programs  or  to  experiment  with  the  data.  These 
programs  are  not  necessary  for  classification,  but  knowing  about  them  might  help  to 
prevent  recoding  routines  that  are  already  there.  In  the  case  of  user  interface  programs, 
the  best  way  is  to  run  them  and  become  familiar  with  the  way  they  work.  They  were 
intended  to  be  very  flexible,  and  usually  by  changing  a  few  parameters  inside  the  code, 
they  can  be  used  for  other  purposes. 

Classifier  programs  were  used  not  only  to  classify  a  given  data  set,  but  also  to  select  a  set 
of  good  features  from  all  the  features  that  initially  were  tried.  For  a  detailed  discussion  of 
the  steps  involved  in  this  refer  to  Shahab's  and  Mitra's  reports.  Some  of  the  programs  and 
data  files  which  were  used  or  produced  in  this  stage  are  explained  here: 

Classify  is  a  Matlab  program  that  loads  a  feature  matrix  from  a  .mat  file,  randomly  breaks 
it  into  a  set  of  training,  and  a  set  of  testing  feature  vectors,  classifies  every  entry  in  the 
testing  set  using  all  the  entries  in  the  training  set  by  calling  either  fknn  or  cknn  programs, 
repeats  this  process  a  number  of  time,  and  returns  the  result  of  classification  of  each  file 
and  the  percentage  of  correct  classification  and  a  performance  index  for  the  classification. 
Some  of  the  parameters  like  the  filename  to  load  can  be  changed  inside  the  program 
classify.m.  Other  parameters  can  be  changed  while  the  program  is  running.  This  program 
is  extremely  useful  for  experimenting  with  combinations  of  features,  and  even  includes  an 
option  to  plot  the  scattering  of  the  first  two  features. 

Note;  By  setting  percent_training=l.  The  testing  and  training  sets  wont  be  randomly 
selected,  instead,  all  the  entries  except  one  are  used  in  the  training  set  and  that  entry  is 
classified.  This  action  is  repeated  for  all  the  entries  in  the  matrix. 

Clas_aut  is  an  automated  version  of  classify  program.  Instead  of  asking  the  user  for 
entering  parameters,  this  program  includes  a  loop  that  checks  the  classifications  using  all 
the  features  individually.  The  results  are  saved  in  a  file  called  clas_res.  All  the  other 
parameters  should  be  set  in  the  program.  It  should  be  noted  that  running  this  program 
might  take  a  long  time  depending  on  the  number  of  features  and  repetitions.  Clasaut2, 
clasaut3,  and  clasaut4  are  alterations  of  clas_aut  that  instead  of  using  single  features  use 
combinations  of  2  to  4  features.  Clasaut2  tries  all  the  pairwise  combinations  of  the 
features.  ClasautS  and  clasaut4  use  the  combinations  of  2  and  3  features  supplied  to  them 
in  the  program  and  combine  them  with  other  features  to  test  the  combinations  of  3  and  4 
features. 

bestfk  is  a  Matlab  script  that  sorts  the  features  according  to  their  performance  in 
classifying  the  files.  Note  that  the  correct_classification  vectors  for  data  sets  1-3  were 
saved  as  resl,  res2,  and  res3  in  a  file  called  Knn-res.  This  file  is  loaded  by  the  bestfk 
program.  The  best  features  are  found  for  the  three  data  sets.  For  more  details  about  the 
selection  strategy  refer  to  Shahab's  report.  It  is  informative  to  look  at  the  program  code 
to  find  out  about  the  outputs  that  it  produces. 

Bestfs  is  the  same  program  as  bestfk.  The  only  difference  is  that  it  loads  the  results  from  a 
file  called  scat_res.  This  file  is  produced  by  saving  the  results  of  the  scatter  criterion. 


Scat  is  a  Matlab  function  that  finds  the  scatter  criterion  for  the  feature  vectors  in  a  matrix. 
It  was  used  for  the  feature  matrices  of  sets  1-3,  and  the  results  were  saved  in  scat_res. 
BestfZ,  bestf3,  and  bestf4  work  the  same  way  as  bestfk,  but  as  output  give  the  best 
combinations  of  2-4  features. 


Appendix:  A  listing  of  the  Matlab  programs 


best£2: 

This  Matlab  script  finds  the  best  30  combinations  of  features  from  three 
sets  of  features.  Same  features  are  tried  on  3  sets  of  data. 

This  is  used  to  rank  the  combinations  of  2  features 


bestf3 : 

Same  as  bestfZ,  but  for  combinations  of  3  features. 


bestf4: 

Same  as  bestfZ,  but  for  combinations  of  4  features. 


bestfs: 

This  Matlab  script  tries  a  method  to  find  the  best  30  features  from  three 
sets  of  features.  Same  features  are  tried  on  3  sets  of  data.  Scatter  criterion  is 
used  to  measure  each  feature's  performance. 

Note  that  for  the  features  1-651  each  set  of  seven  features  are  in  fact  the  same 
feature  combined  differently  for  different  features.  For  the  rest  of  the  features 
i.e.  652-669  each  set  of  three  is  the  same  feature. 


bestfk: 

This  Matlab  script  tries  a  method  to  find  the  best  30  features  from  three 
sets  of  features.  Same  features  are  tried  on  3  sets  of  data.  The  results  of 
classification  using  a  KNN  classifier  is  saved  on  a  vector  called  correct_res. 
Note  that  for  the  features  1-651  each  set  of  seven  features  are  in  fact  the  same 
feature  combined  differently  for  different  features.  For  the  rest  of  the  features 
i.e.  652-669  each  set  of  three  is  the  same  feature. 


clas_aut 

This  program  adds  a  loop  to  the  classify  program.  It  repeats  classification 
for  different  input  vectors.  It  saves  the  results  (percentage  correctly 
classified  and  performance  index) 
as  two  vectors  in  a  file  called  clas  res. 


clasaut2: 

This  program  adds  a  loop  to  the  classify  program.  It  repeats  classification 
for  different  combinations  of  2  features.  It  saves  the  results  (percentage  correctly 
classified  and  performance  index)  and  the  indexes  of  these  features 
in  a  file  called  clasres2. 


clasaut3: 

same  as  clasaut2,  but  for  the  combinations  of  3  features. 


clasaut4: 

same  as  clasaut4,  but  for  the  combinations  of  4  features. 


classify: 

This  script  parses  a  matrix  of  polygraph 
vectors  into  training  and  testing  vectors. 
It  then  calls  the  classifier,  trains,  tests 
and  gives  results. 


cluster! : 

This  is  a  program  that  tests  the  K-Nearest-Neighbor 
algorithm  with  a  set  of  two  class  data  that  have 
gaussian  distribution 


cluster2: 

Another  program  like  cluster! . 


feattst; 

An  older  version  of  classify. 


feattst2: 

Another  version  of  feattst. 


featurev: 

Mitra 


plotf 

This  script  prompts  the  use  to  enter  two  features  and  plots  them. 


randvect: 

function  [y,x]  =  randvect(elements, maximum) 

This  function  creates  a  vector 
of  random  numbers  between  ! 
and  the  maximum  number  given 
to  the  function  (maximum). 

The  length  of  the  vector  is 
specified  by  the  number  of 
elements  given  to  the  function, 
e.g.  randvect(elements,maximum) 


scat: 

function  J=scat( Sample,  Class) 


J=scat(Saniple,  Class) 

returns  a  value  that  shows  how  the  labeled  samples  of  a  two  class  distribution 
are  scattered.  Samples  is  a  vector  that  contains  the  values  of  the  samples. 
Class  is  a  vector  that  contains  the  class  labels(0  or  1). 

The  criterion  function  is: 

J=(ml-m2)^2  /  sl''2+s2^2 

m's  are  the  means  for  the  classes  and  S's  are  scatters  of  samples. 

Larger  result  means  better  separation  between  the  classes. 

Reference;  Pattern  Classification  and  Scene  Analysis,  Duda  and  Hart 


scatv 

function  JV=scatv(M,  Class) 

scatv  returns  a  vector  that  contains  the  scatter  criterion  of  a  matrix, 
each  row  of  the  matrix  M  contains  values  of  the  samples  for  one  feature. 
Class  is  the  class  labels  for  the  samples, 
see  also  scat 
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§1.  ABSTRACT 


Polygraph  tests  are  a  widely  used  method  to  distinguish  between  truth  and  deception. 
During  a  polygraph  test,  the  subject  is  asked  a  series  of  control,  relevant  and  irrelevant 
questions  which  provide  different  physiological  responses  useful  for  a  comparison.  The 
three  physiological  responses  that  are  currently  measured  are  Electrocardiogram,  Galvanic 
Skin  Response  (GSR)  and  Respiration. 

Polygraph  charts  are  usually  analyzed  by  human  interpreters.  However,  computer 
algorithms  are  now  being  developed  to  score  the  tests  or  verify  the  results.  These  methods 
are  based  on  statistical  classification  techniques. 

In  this  study  two  different  fuzzy  algorithms  were  implemented  to  classify  the  polygraph 
charts,  using  a  number  of  time,  frequency  and  correlation  domain  features.  These  two 
algorithms  and  their  results  were  then  compared  with  those  from  the  previous  works.  The 
major  advantage  of  using  fuzzy  set  theory  is  that  it  does  not  simply  assign  each  input  to 
one  of  the  clusters,  but  it  gives  a  degree  of  belonging  of  an  input  to  each  cluster. 

The  average  correct  detection  rate  we  achieved  in  this  study  was  80%  -  85%.  Using 
certain  set  of  data  we  even  obtained  up  to  97%  correct  detections. 
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§2.  INTRODUCTION 


2.1.  POLYGRAPH^ 


2. 1. 1.  Preview: 


Polygraph  examinations  are  the  most  widely  used  method  to  distinguish  between  truth  and 
deception.  In  a  Polygraph  examination  a  person  is  connected  to  a  special  instrument  called 
a  Polygraph  which  records  several  physiological  signals  such  as  blood  pressure,  Galvanic 
Skin  Response,  and  respiration.  The  subject  is  asked  a  set  of  questions  by  an  examiner.  By 
looking  at  these  signals  the  examiner  is  able  to  determine  the  reactions  of  the  subject  to 
the  questions  and  decide  whether  the  person  was  truthful  or  deceptive  in  answering  each 
question. 

The  problem  with  human  classification  of  Polygraph  tests  is  that  the  outcome  depends  on 
the  examiner's  experience  and  personal  opinion.  Automatic  scoring  of  Polygraph  tests  has 
been  a  subject  of  extensive  research.  Several  methods  for  Polygraph  classification  have 
been  studied  which  are  mostly  based  on  statistical  classification  techniques. 

Digitized  Polygraph  data  used  in  this  project  were  collected  from  various  police  stations. 
The  data  files  were  organized  according  to  the  test  format  used  and  were  decoded  to 
ASCII  format  so  they  can  be  read  by  Matlab.  Preprocessing  and  feature  extraction 
routines  were  implemented  in  the  Matlab  language  in  privious  works  [Layeghi  1993,1] 
[Dastmalchil993][Jacobsl993].  Three  sets  of  files  were  chosen,  each  one  of  them 
contained  50  deceptive  and  50  non-deceptive  files. 

These  files  are  listed  in  the  appendix,  Fig.42. 

2.1.2.  History: 


The  first  attempt  to  use  a  scientific  instrument  in  an  effort  to  detect  deception  occurred 
around  1895  [Reid  1966].  That  was  the  year  that  Caesar  Lombroso  published  the  results  of 
his  experiments  in  which  a  hydrosphygmograph  was  used  to  measure  the  blood  pressure- 
pulse  changes  of  criminals  in  order  to  determine  whether  or  not  they  were  deceptive. 
Although  the  hydrosphygmograph  was  originally  intended  to  be  used  for  medical 

^Portions  of  this  section  were  extracted  from  [Layeghil993,l]  using  particularly  [Cappsl992]  [01senl983] 
[Reidl966]. 
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purposes,  Lombroso  found  that  it  worked  well  for  lie  detection.  Lombroso  may  have  been 
the  first  to  use  a  peak  of  tension  test  format.  This  was  done  by  showing  a  suspect  a  series 
of  photographs  of  children,  one  being  the  victim  of  sexual  assault.  If  the  suspect  did  not 
react  more  to  the  victims  picture  than  the  pictures  of  the  other  children,  Lombroso 
concluded  that  the  suspect  did  not  know  what  the  victim  looked  like  and  therefore  was  not 
the  alleged  perpetrator. 

In  1914  Vittorio  Benussi  published  his  research  on  predicting  deception  by  measuring 
recorded  respiration  tracings  [Cappsl992].  He  found  that  if  the  length  of  inspiration  were 
divide  by  the  length  of  expiration,  the  ratio  would  be  larger  after  lying  than  before  lying 
and  also  before  telling  the  truth  than  after  telling  the  truth.  In  1921  John  A.  Larson 
constructed  an  instrument  capable  of  simultaneously  recording  blood  pressure  pulse  and 
respiration  during  an  examination  [Reid 1966]  [Capps  1992].  Larson  reported  accurate 
results  which  prompted  Leonarde  Keeler  to  construct  a  better  version  of  this  instrument  in 
1926  [Reid  1966]  [Capps  1992]. 

The  use  of  galvanic  skin  response  in  lie  detection  began  during  the  turn  of  the  century.  It's 
usefulness,  however,  did  not  become  evident  until  the  1930's  during  which  time  several 
articles  written  by  Father  Walter  G.  Summers  of  Fordham  University  in  New  York 
[Cappsl992].  In  these  articles  he  reports  over  90  criminal  cases  in  which  examination 
using  the  galvanic  skin  response  had  all  been  successful  and  confirmed  by  confession  or 
supplementary  evidence. 

The  usefulness  of  the  galvanic  skin  response  prompted  Keeler  to  add  an  galvanometer  to 
his  polygraph.  At  the  time  of  Keelers  death  in  1949,  the  Keeler  Polygraph  recorded  blood 
pressure-pulse,  respiration,  and  galvanic  skin  response  [Reid  1966]. 


2.1.3.  Modern  Test  Formats: 

The  effectiveness  of  a  polygraph  examination  is  often  the  result  of  the  test  format  that  is 
used.  A  polygraph  test  format  consists  of  an  ordered  combination  of  relevant  questions 
about  an  issue,  control  questions  that  provide  a  physical  response  for  comparison,  and 
irrelevant  questions  that  also  provide  a  response  or  the  lack  of  a  response  for  comparison 
[01senl983][Cappsl992]. 
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Three  general  types  of  test  formats  are  in  use  today.  These  are  Control  Question  Tests, 
Relevant-Irrelevant  Tests,  and  Concealed  Knowledge  Tests.  Each  of  the  general  test 
formats  may  have  a  number  of  more  specific  variations.  Each  examination  consists  of  two 
to  five  sessions  containing  a  prescribed  series  of  questions.  The  test  format  that  is  used  in 
an  examination  is  determined  by  the  test  objective  [Reid 1966]  [Capps 1992]. 

1.  The  Concealed  Knowledge  Test,  also  called  peak  of  tension  test,  is  used  when  facts 
about  a  crime  are  known  only  by  the  investigators  and  not  by  the  public.  In  this  case,  a 
subject  would  not  know  the  facts  unless  he  or  she  was  guilty  of  the  crime.  For  example,  if 
a  gun  was  used  in  a  crime  and  the  public  did  not  know  the  caliber ,  an  examiner  could  ask  a 
suspect,  if  it  was  a  22  caliber,  a  38  caliber,  or  a  9  mm.  If  the  gun  used  was  a  9  mm  and  the 
suspect  was  deceptive,  a  polygraph  chart  would  probably  indicate  evidence  of  deception. 

2.  A  Control  Question  Tesfi  is  often  used  in  criminal  investigations.  In  this  type  of  test  a 
series  of  relevant,  irrelevant,  and  control  questions  are  asked: 

•  A  relevant  question  is  one  which  is  specific  to  the  crime  being  investigated. 

For  example,  "Did  you  steal  the  money?''. 

•  A  control  question  is  designed  to  make  the  subject  feel  uncomfortable.  It 
is  not  specific  to  the  crime  being  investigated  however  it  may  be  related  in 
an  indirect  way.  A  control  question  that  could  follow  the  relevant  question 
stated  above  is  "Have  you  ever  taken  anything  that  did  not  belong  to  you?". 

The  control  questions  are  compared  to  the  relevant  questions  and  if  the 
responses  to  the  relevant  questions  are  greater,  the  subject  is  usually  classified 
as  deceptive. 

•  Irrelevant  questions  are  used  as  buffers.  Examples  of  irrelevant  questions  are 
"Are  the  lights  in  this  room  on?"  or  "Is  today  Monday?". 

3.  Relevant-Irrelevant  Tests  are  usually  used  to  test  people  trying  to  obtain  security 
clearance  or  get  a  job.  In  this  test,  relevant  questions  are  compared  to  irrelevant  questions. 
Very  few  control  questions  are  asked.  The  purpose  of  control  questions  in  this  test  is  to 
make  sure  that  the  subject  is  capable  of  reacting  at  all. 


2  It  was  decided  to  use  this  method  in  our  project  (as  it  was  also  in  previous  works). 
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2. 1. 4.  Present  Day  Equipment 


The  most  popular  polygraph  machines  today  are  the  Reid  Polygraph  developed  in  1945 
and  the  Axciton  Systems  computerized  polygraph  developed  in  1989  [01senl983].  The 
Reid  polygraph  scrolls  a  piece  of  paper  under  pens  that  record  the  biological  signals.  The 
Axciton  polygraph  digitizes  physiological  signals  and  uses  a  computer  to  process  them. 
The  sampling  frequency  of  the  Axciton  machine  is  30  Hz.  Axciton  provides  a  computer 
based  system  for  ranking  the  subject  responses  but  allows  printouts  of  the  charts  to  be 
scored  by  hand  the  traditional  way. 

Both  machines  record  the  same  biological  signals  using  standard  methods.  Blood  pressure 
is  measured  by  placing  a  standard  blood  pressure  cuff  on  the  arm  over  the  brachial  artery. 
Respiration  is  monitored  by  placing  rubber  tubes  around  the  abdominal  area  and  the  chest 
of  the  subject.  This  results  in  two  signals,  a  lower  and  upper  respiratory  signal.  Skin 
conductivity  is  measured  by  placing  electrodes  on  two  fingers  of  the  same  hand. 

The  focus  of  this  thesis  is  to  investigate  two  different  fuzzy  pattern  recognition  algorithms 
using  the  aforementioned  signals. 
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2.2.  PATTERN  RECOGNITION  UTILIZING  FUZZY  TOOLS 


2.2.1.  Why  the  "FUZZY"  approach? 

While  observing  the  history  of  science,  we  notice  that  one  of  its  major  goals  has  always 
been  what  we  call  today  "pattern  recognition".  Having  this  in  mind,  man  created  models, 
functional  relationships  and  mathematical  tools  to  come  closer  to  a  perfect  and  precise 
model  for  almost  every  area  of  the  nature  and  our  being.  In  fact,  "precision"  became  more 
and  more  important,  to  the  extent  that  an  imprecise  model  was  a  bad  model  by  default. 

1965  Lotfi  A.  Zadeh  introduced  in  his  innovative  paper  [Zadehl965]  an  "imprecise" 
structure  for  mathematical  observation;  Hence,  the  fuzzy  set  was  bom.  A  companion  to 
the  classical  one  with  often  more  useful  and  suitable  representation  of  our  environment. 

"The  fuzzy  set  was  conceived  as  a  result  of  an  attempt  to  come  to  grips  with  the  problem 
of  pattern  recognition  in  the  context  of  imprecisely  defined  categories.  In  such  cases,  the 
belonging  of  an  object  to  a  class  is  a  matter  of  degree,  as  is  the  question  of  whether  or  not 
a  group  of  objects  form  a  cluster";  These  were  the  introductory  words  from  L.A.  Zadeh  in 
[Bezdekl981].  They  summarize  the  fundament  of  any  fuzzy  clustering  or  classifying 
algorithm  concerning  any  search  of  data  structure  or  pattern  recognition.  This  concept  is 
exactly  what  this  project  is  all  about. 

An  example; 

Imagine,  you  have  two  groups  of  objects  "chairs"  and  "desks"  in  different  varieties.  In  a 
simple  version  of  a  typical  pattern  recognition  problem,  you  have  the  task  to  cluster  or 
classify  the  given  objects  into  these  two  groups.  In  reality,  we  will  also  have  other  objects 
like  a  big  box  or  a  bed  within  the  pool  of  the  objects,  but  only  the  two  aforementioned 
clusters  by  definition.  Now,  a  conventional  crisp  clustering  method  would  put  these 
critical  objects  in  either  one  of  these  two  clusters.  Thus,  the  big  box  or  the  bed  may  be 
labeled  as  if  they  would  be  chars. 

A  fuzzy  clustering  method  would  label  the  objects  with  soft  membership  values.  In  this 
case,  a  big  box  (that  can  be  used  as  a  chair  or  a  desk)  might  be  labeled  with  0.6  degree 
chair  and  0.4  desk.  Information  like  this  serves  a  useful  purpose  -  "fuzzy  memberships  in 
several  classes  are  a  signal  to  take  a  second  look"  [Bezdekl993]  [Bezdekl992]; 
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Hard  memberships  of  data  cannot  support  this.  Thus,  the  fuzzy  model  provides  a  richer 
and  more  flexible  solution  structure,  one  that  models  the  real  objects  with  a  finer  degree  of 
detail  than  the  harshness  of  the  crisp  models.  Notice  also  that  hard  membership  values 
build  a  subset  of  the  fiizzy  membership^  set. 

There  are  different  types  of  fuzzy  algorithms  to  find  the  appropriate  membership  values 
within  the  data.  In  this  project,  we  used  the  follwoing  two  approaches: 

1.  Clustering  algorithms: 

Given  any  finite  data,  the  problem  of  clustering  is  to  find  similarities  between  the  objects 
of  the  data  and  to  assign  labels  that  matching  objects  would  belong  to  the  same  subgroups. 
The  algorithm  starts  its  search  without  any  initial  interpretative  information  about  the  data 
elements.  It  only  seeks  for  objective  numerical  similarities  between  the  elements.  Because 
the  initial  objects  are  unlabeled,  this  method  is  often  called  "unsupervised  learning".  The 
word  learning^  implies  that  the  clustering  algorithm  will  ultimately  find  the  correct  labels 
at  the  end  of  the  process.  This  is  what  we  hope  to  obtain,  but  we  do  not  know  it  a  priori. 

Notice  that  because  of  the  unsupervised  nature  of  this  algorithm,  we  may  find  "correct" 
clusters  which  represent  some  similarities,  but  not  the  ones  we  were  looking  for.  In  the 
aforementioned  example  with  chairs  and  desks,  the  algorithm  may  provide  two  clusters  of 
"wood-made"  and  "metal-made"  objects  (which  are  also  correct),  but  not  "chairs"  and 
"desks"  as  we  had  hoped  for. 

In  this  case,  the  performance  of  a  clustering  model  is  influenced  by  the  choice  of  the 
parameters^,  features,  geometrical  properties  and  our  eventual  interpretation  of  the  labels. 

2.  Classifying  algorithms: 

In  contrast  to  a  clustering  system  which  labels  a  given  data,  a  classifier  is  capable  -  once  it 
is  defined  (and  trained)  -  of  labelling  every  appropriate  data.  In  addition,  a  classifying 
system  is  ususally  initialized  by  labeled  objects.  In  these  cases,  we  call  this  method 
"supervised  learning". 


^Notice  that  membership  values  are  not  probabilities;  they  are  similarities  of  object  vectors  to  a  class 
structure.  They  represent  the  degree  of  belonging  of  an  object  to  a  group  of  objects. 

^The  word  learning  does  not  imply  any  training.  In  fact,  a  clustering  system  -  as  is  its  nature  -  is  almost 
the  opposite  of  any  system  which  learns  by  training. 

^See  chapter  2.2.3.2.  for  the  meanings  of  the  parameters  and  chapter  3. 1.3.3.  for  the  strategies  we  used. 
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Notice  that  we  can  also  use  a  clustering  algorithm  as  a  modified  classifying  algorithm: 

After  having  set  the  optimal  combination  of  parameters  and  features,  we  can  use  the 
clustering  system  to  classify  any  new  data  by: 

•  adding  the  new  element  to  a  given  and  already  correct  clustered  data,  and  letting 
the  system  relabel^  the  data.  Thus,  our  new  object  ends  up  to  be  in  one  of  the 
clusters  representing  its  identity, 

•  saving  all  the  parameters,  cluster  centers  and  the  data  elements  and  calculate 
appropriately  the  membership  value  of  the  new  object,  which  will  eventually 
represent  its  identity. 


^Running  a  new  clustering  process  with  one  more  element  will  probably  change  the  structure  of  the 
original  clusters,  because  the  cluster  centers  and  the  membership  values  of  each  element  depend  on  all  of 
the  members.  In  spite  of  this  fact,  we  will  be  able  to  classify  a  normal  ( =  not  an  outlier )  object  by  having 
a  large  number  of  already  clustered  objects  in  a  stable  condition. 
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2.2.2.  Why  fuzzv-c-means  (FCM)? 


One  of  the  most  significant  characteristic  of/wzg^-c-means  algotithm  is  its  fuzziness  as 
the  name  assumes.  Unlike  crisp  clustering  methods,  FCM  gives  us  "membership  functions 
c  [0,  1]  which  determine  the  grade  of  belongingness  of  the  elements  to  a  cluster.  As 
mentioned  before,  this  information  is  totally  lost  by  conventional  clustering  techniques. 
The  advantage  of  FCM  is  the  fact  that  the  results  we  may  get  from  a  crisp  clustering 
method  are  automatically  within  those  fi'om  FCM. 

We  chose  FCM  as  an  alternative  and  a  comparison  to  the  fiizzy  K-Nearest-Neighbor 
algorithm  (KNN)  investigated  previously  [Layeghil993,l][Dastmalchil993][Jacobsl993], 
specially  because  FCM  is  an  unsupervised  clustering  method  which  works  only  by  using 
"mathematical"  tools  such  as  spatial  distances  or  similarities,  without  any  training  or 
additional  interpretative  information. 

By  this  method,  good^  features  will  then  hopefully  provide  an  optimal  mathematical 
grouping  that  presents  in  some  sense  an  accurate  portrayal  of  natural  structures  in  the 
physical  process  from  where  the  polygraph  data  are  drived. 


Whv  we  chose  FCM  aleorithm: 

Because  it 

• 

does  not  need  previous  training, 

• 

does  not  make  any  assumption  about 
the  distribution  of  samples, 

• 

is  unsupervised,  objective  and  self  organized. 

• 

can  be  used  as  an  alternative  and  a  comparison 
to  fuzzy  KNN  investigated  previously. 

1  - - - - - 

Fig.l:  FCM  characteristics 

^See  chapter  2. 1 . 1 .  for  characteristics  of  a  fuzzy  approach. 

*"Good"  features  are  in  our  study  those  which  can  cluster  the  data  in  deceptive  and  truthful  groups. 
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r  Fiizxv-c-means  algorithm  and  its  interpretation 


2.2. 3. 1.  FCM  code  -An  iterative  procedure: 


The  fuzzy-c-means  algorithm^  is  basically  an  iterative  procedure  to  minimize  an  objective 
function  representing  a  spatial  fiizzy  distance  between  data  points  and  cluster 
centers  v,- .  In  this  project,  I  chose  the  most  widely  used  Euclidean  distance,  i.e.  the  sum  of 

the  squared  errors  performance  index; 


n 


2 

A 


k=\  /=1 


•  X  =  {xi,X2,...,x„  )  c  9?*  is  a  finite  data  set  in  the  pattern  space 

•  c  is  a  fixed  and  known  number  of  clusters  (here:  c=2). 

•  U  =  [W/J  e  91  is  a  fuzzy  c-partition  of  X,  w,*  is  referred  to  as  the  grade  of  membership 
of  Xk  to  the  cluster  i.  satisfy  the  following  constrmnts; 

€[0,1];1</<c,1<A:<« 

C 

1  =  1 

n 

<n;l  </<c 

k=\ 


*  V  =  (  Vi,V2,...,v^  )  e91‘^  ;  each  v,-  e91*  represents  a  prototype  of  class  i. 

•  m  is  the  weighting  exponent  and  represents  the  level  of  fuzziness;  1  <  /n  <  oo . 


9[Ruspinil969]  was  the  first  one  who  suggested  the  structure  of  fuzzy-c-partition  spaces.  The  fuzzy-c- 
means  algorithm  (originally  ISODATA)  was  initially  developed  by  [Dunnl974]  and  generaUzed  by 

[Bezdekl973].  . 

Dunn  extended  and  developed  the  classical  "within-groups  sum  of  the  squared  errors"  (WGSS)  funcUon  to 
a  fuzzy  clustering  criterion  and  developed  the  fuzzy-c-means  clustering  algorithm  to  minimize  the 
objective  function  through  an  iterative  method.  Bezdek  further  extended  the  fuzzy  objective  function 
proposed  by  Dunn  to  a  more  genral  form  of  fiizzy  clustering  criterion  by  introducing  the  weighting 
exponent  m,  1  <  m  <  oo.  It  turns  out  that  Dunn's  function  is  a  special  case  ( m=2  )  of  an  infinite  family  of 
objective  functions. 
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is  an  inner  product  induced  norm  on  5R*. 


By  differentiation 
fixed,  we  obtain 


J^iU ,v)  with  respect  to  where  v,-  is  fixed  and  to  where  U  is 


^ik  = 


1 


I 


>1 


k-v,f 

h-.f 


and 

v=-!^ - . 

!(%)” 

k=\ 

These  two  equations  cannot  be  solved  analytically,  but  approximate  solutions  can  be 
obtained  by  an  iterative  procedure.  The  FCM  uses  iterative  optimization  of  an  objective 
function  based  on  a  weighted  similarity  measure  between  data  points  and  cluster  centers. 


Step  1.  Input  the  number  of  clusters,  c,  the  weighting  exponent,  m,  and  the  error 
tolerance,  e. 

Step  2.  Input  the  data  X  =  {  }. 

Step  2.  Initialize  the  membership  values  U  =  [w/*  ]. 


Step  4.  Calculate  the  new  cluster  centers  by  the  3rd  equation 


Step  5.  Update  the  by  the  2nd  equation. 

Step  6.  Return  to  Step  3,  if  >  .e;  otherwise  output  U... 


X:  [  sxn  ]  n:  #  of  data  elements  -  polygraph  test  sessions. 

U:  [  cxn  ]  s:  #  of  features  -  dimension  of  the  samples  in  each  cluster. 

V:  [  sxc  ]  c:  #  of  clusters 


Fig.2:  The  iterative  FCM‘o  procedure 


‘°See  Fig.3  ,  the  flow  chart  of  the  FCM  code  implemented  in  this  project. 
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Fig.3:  Flow  chart  of  the  FCM  code  implemented  in  this  project 


2.2. 3,2.  What  the  influential  parameters  practically  mean  or  represent, 
and  how  to  interpret  the  clusterins  algorithm  itself: 

The  weighting  exponent  m  represents  the  "fuzziness"  level.  It  controls  the  extent  of 
membership  sharing  among  the  fuzzy  clusters.  Recall  the  example  of  the  two  clusters, 
"desks"  and  "chairs"  in  chapters.  1;  In  a  hard  c-means  clustering  environment  (m  1)  each 
object  can  either  belong  to  "chairs"  or  "desks",  i.e.  its  membership  value  is  either  one  or 
zero  for  each  cluster.  Now,  the  higher  m  is,  the  fuzzier  the  results  will  be.  Thus,  a  desk  - 
.  which  can  also  be  used  as  a  chair-  may  get  a  membership  value  higher  than  zero  for 
belongingness  to  the  chairs  cluster.  In  this  sense,  m  controls  the  membership  values  as 
following 

lim  U:i,  =  — , 

m-^00  C 
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The  control  parameter  epsilon  represents  the  interrupt  criterion.  It  influences  the  number 
of  iterations  and  therefore  the  accuracy  of  the  algorithm  which  is  the  search  for  c  minima. 
By  making  epsilon  smaller  we  get  more  accurate  clustering  results,  but  also  more 
computing  time,  which  is  not  important  in  this  specific  case. 

The  algorithm  primarily  gives  us  after  each  iteration  new  cluster  centers  V^  and  new 
membership  values  U .  It  then  calculates  the  spatial  distances  between  each  data  element 
and  the  found  cluster  centers  then  checks  the  interrupt  criterion.  If  these  distances  are 
small  enough,  the  algorithm  will  eventually  give  us  the  best  membership  values  and  the 
appropriate  cluster  centers.  At  this  point,  the  search  for  an  internal  structure  within  the 
polygraph  data  -the  original  intention  of  every  clustering  process-  will  be  finished. 

FCM  algorithm  belongs  to  the  so-called  partitional  clustering  algorithms  which  generate  a 
fuzzy  c-patition  matrix  in  a  feature  space.  In  this  project  I  set  the  number  of  clusters  c,  as  a 
known  parameter,  equal  to  two.  It  can  otherwise  be  a  part  of  the  clustering  optimization 
itself  This  decision  was  made  after  running  some  initial  tests  with  c  =  3  as  well,  which 
represents  "deceptive",  "truthful"  and  "ambiguous"  clusters. 


Fig.4:  Fuzzy  C-means  algorithm  applied  on  polygraph  data 
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2.2.4.  Why  LMS  fuzzy  adaptive  filter? 


Filters  are  information  processors.  In  practice,  information”  usually  exists  in  two  different 
modes: 


•  Numerical  data  associated  with  the  problem, 

•  linguistic  descriptions  of  human  experts 
(often  in  the  form  of  fuzzy  IF-THEN  rules) 

Conventional  filters  can  only  process  numerical  data,  whereas  expert  systems  can  only 
make  use  of  linguistic  information,  i.e.  a  successful  pattern  recognition  system  in 
conventional  form  can  only  be  guaranteed  where  either  linguistic  rules  or  numerical  data 
do  not  play  a  critical  role.  Recall  the  fact  that  even  in  those  cases  we  decide  for  a 
numerical  method,  we  use  linguistic  information,  consciously  or  unconsciously,  in  the 
choice  among  different  filters,  the  evaluation  of  filter  performance,  the  choice  of  the  filter 
orders,  the  interpretation  of  filtering  results,  and  so  on. 

The  LMS”  fuzzy  adaptive  filter  is  a  new  kind  of  nonlinear  adaptive  filter  which  makes  use 
of  both  linguistic  and  numerical  information  concerning  the  physical  characteristics  of  the 
polygraph  data  in  their  natural  form.  This  filter  is  constructed  from  a  set  of  changeable 
fiizzy  IF-THEN  rules,  i.e.  we  have  the  choice  of  setting  the  rules  according  to  our 
experiences  and  incorporating  them  directly  into  the  filter,  or  initializing  the  rules 
arbitrarily;  similar  to  the  polynomial,  neural  nets,  or  radial  basis  function  adaptive  filters. 

2.2.5.  LMS  fuzzy  adaptive  filter  and  its  interpretation; 

2.2.5. 1.  Filter  code  -  An  adaptive  procedure 

As  stated  before,  this  filter  is  constructed  from  a  set  of  changeable  fiizzy  IF-THEN  rules 
by  matching  input-output  pairs  through  an  adaptation  procedure.  The  adaptive  algorithm 
updates  the  parameters  of  the  membership  functions  which  characterize  the  fuzzy  concepts 
in  the  IF-THEN  rules  by  minimizing  a  criterion  function. 

Consider  a  real-valued  vector  sequence  [x(*)]  and  a  real  valued  scalar  [c/(A:)].  The  adaptive 
filter  /t :  C/  i?  is  to  determine,  such  that  L  =  E[{d{k)-fk ]  is  minimized. 

”  About  the  pattern  of  the  subject  to  be  studied. 

^^LMS  =  Least  Mean  squares 
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With  ^  =  1,2,3,...  and  xW  et/^[C,-,C;]x[Q,C;]x...x[C;,C;]c7?".  U  and  R  are 
the  input  and  output  spaces  of  the  filter,  respectively. 

The  following  steps  describe  the  LMS  fuzzy  adaptive  filter^^  used  in  this  project; 

Step  1:  M  fuzzy  sets  F,'  are  to  be  defined  in  each  interval  IC7,Q]  of  U  with  the 
following  Gaussian  membership  functions 


//^(x,)  =  exp 


cr, 


where  /=  1,2,...,M,  /  =  1,2,. ..,w,  x,.  e[C;,C;i  and  xj  and  a/  are  free  parameters  which 
will  be  updated  in  the  LMS  adaptation  procedure  of  Step  4. 


Step  2;  A  set  of  M  fuzzy  IF-THEN  rules  is  to  be  constructed  in  the  following  form: 

R':  if  Xj  is  Fi  and  ...  x„  is  F^,  THEN  disG‘, 

R^:  IFx,isF,^and...x„isF„^,THEtidisG^. 

where  x  =  (x^,...,x^)  eU,  d  sR,  F;''s  are  defined  in  Step  1,  and  G''s  are  fuzzy  sets 
defined  in  R.  The  (parameters  of)  membership  functions  and  in  these  rules  will 

change  during  the  LMS  adaptation  procedure  of  step  4.  Therefore,  the  rules  constructed 
in  this  step  are  initial  rules  of  the  fuzzy  adaptive  filter. 


Step  3:  The  filter  /*:  U-^R  is  constructed  based  on  the  M  rules  of  the  Step  2  as 
follows: 

M  C  n 


M  f  n 


/=l  V  1=1 


where  u  ,'s  are  the  Gaussian  membership  functions  of  Step  1,  and  ei?  is  any  point  at 
which  u.  achieves  its  maximum  value. 

'  G 

i^This  algorithm  is  suggested  in  [Wangl993]  and  [Wangl9941. 
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Because  we  chose  the  membership  functions  to  be  Gaussian  functions  which  are  nonzero 
for  any  x.  e[C^“,C^],  the  denominator  of  the  last  equation  is  nonz^  for  any  xeU. 
Therefore,  the  filter  is  well  defined,  and  because  the  0'  as  well  as  x'  and  a/  are  free 
parameters,  this  filter  is  nonlinear  in  the  parameters. 


Step  4:  The  following  LMS  algorithm  [Widro^985]  is  used  to  update  the  filter 
parameters  0\  x'  and  a/.  With  the  initial  0'{o),  x/(0)  and  a/(0)  values  determined  in 

Step  2,  the  adaptive  procedure  is  as  following: 


(l{k) 


$'(k)  =«'(*-!)  +  a[d(t)  - 

.  a  (t-D— — ^ 


b{k-\) 


cr/(A:)=o;'(^-l)  +  a[4^)-/*]— 


(x,.(^)-x'(/:-l)) 


M 


where  a'ik - 1)  =  bik-l)  =  h\k - 1),  A  = 

and  a  is  a  small  positive  step-size.  These  equations  are  obtained  by  taking  the  gradient  of 
L  ignoring  the  expectation  E  (see  chapter  2.2.5. 1). 


2. 2. 5. 2.  Influential  parameters  -  meanings  &  interpretations: 

The  LMS  algorithm  is  a  gradient  algorithm,  i.e.  a  good  choice  of  initial  parameters  0' ,  x\ 
and  a/  is  very  important  to  its  convergence  concerning  accuracy  and  time.  Since  the  error 
measure  of  this  "back-propagation"  algorithm  is  an  extremely  complicated  function  of  all 
the  parameters  0\  x’  and  <t/,  it  can  have  numerous  local  minima.  Depending  on  the 
initial  parameter  estimates,  this  algorithm  always  leads  to  the  nearest  minimum,  i.e.  it  can 
become  stuck  in  a  local  minimum  of  the  error  measure. 

Recall  that  this  filter  is  constructed  based  on  linguistic  rules  from  our  previous  experiences 
and  some  arbitrary  rules.  Both  sets  of  rules  are  updated  during  the  LMS  adaptation 
procedure  of  Step  4  by  changing  the  parameters  in  the  direction  of  minimizing  L . 
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In  other  words,  the  adaptation  procedure  can  be  directed  to  the  local  minimum  we  want 
(i.e.  accuracy  factor)  and  can  converge  quickly  (i.e.  time  factor). 

if  these  rules  provide  good  instructions  for  how  the  filter  should  perform,  that  is,  good 
description  of  the  input-output  pairs  [x(A:);i/(A:)]. 

The  updating  parameters  ^'[Mxl],  x'[MxN]  anda/[MxN]  represent  output  means,  input 
means  and  the  input  width  of  the  Gaussian  distributed  data,  respectively.  The  scalar  output 
d  is  basically  the  label*^  of  the  test  data[lxN]  in  numerical  form,  and  <j/  describes  how  far 
the  data  from  the  output  mean  can  be  and  still  be  assigned  to  it  in  an  appropriate  fuzzy 
form.  M  represents  the  number  of  the  rules  and  N  the  number  of  the  features,  i.e.  the 
dimension  of  the  data.  The  parameter  a  is  the  "learning  factor"  or  the  step-size  of  training. 
It  represents  how  fast  and  how  smooth  the  training  process  proceeds. 


Fig.5:  The  LMS  fuzzy  adaptive  filter  used  in  this  project 


•^"deceptive"  or  "non-decptive". 
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3.1.  Part  I  -  FCM 


§3.  APPROACH 


3.1,2.  Initial  stage  (conditions  and  methods); 

A  primary  component  of  every  pattern  recognition  problem  is  feature  extraction.  And  this 
is  actually  one  of  the  most  important  and  influential  tasks  for  any  successful  approach. 

In  previous  researches  [Layeghi 1993,1]  [Jacobsl993]  [Dastmalchil993],  students  have 
already  investigated  a  set  of  669  features  for  each  polygraph  test  session.  They  used  these 
features  to  train,  optimize  and  eventually  classify  the  data  by  a  fuzzy  K-Nearest  Neighbor 
algorithm  (KNN). 

In  this  project,  I  have  used  these  same  features  in  their  original  form.  I  have  also  selected 
their  best  features  and  feature  combinations  for  initial  tests  of  my  algorithm  and  for 
comparison  between  fuzzy-CM,  fuzzy  LMS  adaptive  filter  and  the  fuzzy  KNN  approach. 
At  this  point,  the  question  of  consistency  and  transferability  of  the  features  -  independent 
of  the  algorithm  -  became  more  significant.  It  turned  out  to  be  one  part  of  this  research*^ 


session  #1  session  #100 


Fig.6;  An  example  for  a  set  of  polygraph  data  as  a  matrix 
and  its  features  used  in  this  study 

As  mentioned  earlier,  each  feature  (total  number=960)  is  extracted  for  all  polygraph  test 
questions,  that  is  for  relevant,  irrelevant  and  control  questions.  It  was,  however,  decided 

*^See  also  chapter  4.I.2.3. 
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not  to  use  irrelevant  questions  in  this  study,  because  in  a  Controlled  Question  Polygraph 
Test  comparison  between  the  responses  to  relevant  and  control  questions  is  the  actual  and 
most  important  factor. 


Fig.7;  The  original  feature  combinations 


The  Total  number  of  the  features  for  every  test  session  at  this  stage  is  669.  Each  set 
contains  the  same  non-deceptive  files  but  different  deceptive  ones.  For  more  specific 
details  about  how  the  feature  extraction  was  processed,  and  about  combination  methods 
which  narrowed  the  total  number  from  960  to  669,  see  the  references  mentioned  above. 


After  implementation  and  initial  tests  of  the  FCM-code,  I  began  with  the  one-dimensional 
clustering  (using  one  feature  for  all  sessions).  I  used  three  sets  (polydat_l,  polydat_2, 
polydat_3)  of  such  structured  data  as  shown  in  Fig.42  containing  100  data  elements,  i.e. 
50  truthful  and  50  deceptive  files.  With  these  data,  we  ran  669  one-dimensional  clustering 
searches  containing  100  different  one-dimensional  data  points  at  each  time.  As  a  result,  we 
attained  669  times  2  clusters  for  each  polydatj. 
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After  running  these  tests  and  evaluating  them,  I  decided  to  select  four  sets  of  "best"  one¬ 
dimensional  features  out  of  each  polydat_i  in  preparation  for  the  multi-dimensional 
clustering  search.  This  decision  was  necessary  to  narrow  the  number  of  features,  since  it  is 
impractical  to  find  the  best  combination  (concerning  the  quantity  and  the  quality)'®  out  of 
this  immense  number  of  features  by  an  exhaustive  way  of  searching. 


For  example,  chosing  only  4  or  less  feature-tuples  from  a  set  of  669  by  trying  all  the 
possible  different  combinations  needs  the  following  number  of  computations: 


4 

I 

i  =  l 


y  669! 
i5li!(669-i)! 


The  other  challenge  while  finding  good  feature  combinations  is  the  problem  of  single 
features  which  yield  poor  results  by  one-dimensional  clustering,  but  when  used  in 
combination  with  other  features  yield  very  good*’  results. 


To  narrow  the  amount  of  possible  features,  I  decided  to  select  the  following  four  sets  of 
single  features  with  different  performances. 


Fig.8;  Selected  features  by  using  one-dimensional  FCM 


The  threshold  of  60%  was  chosen,  because  any  other  value  below  or  above  that  limit 
would  again  give  us  either  too  many  or  not  enough  features.  Furthermore,  any  other  value 

'®That  means:  How  many  features  and  which  ones  should  be  taken  in  a  combination, 

'^"Good"  or  "poor"  in  sense  of  the  definition  in  chapter  1.1.2. 
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closer  to  the  limit  50%  for  both  deceptive  and  non-deceptive  files  would  be  only  a  random 
clustering  process.  Yet,  this  decision  was  not  enough.  We  would  have  lost  some  good 
features  which  provide  correct  detections  -  better  than  80%  -  for  at  least  one  of  the  files. 
The  fourth  group  was  chosen  to  enable  us  to  consider  some  extreme  cases. 

As  an  additional  set  of  one-dimensional  features,  I  chose  those  with  good  results  in  multi¬ 
dimensional  tests**  for  one  of  the  polydat_i's,  and  used  them  also  for  the  other  two 
polydatj's,  even  though  they  didn't  belong  to  one  of  the  four  feature  sets  mentioned 
above.  This  set  was  important  to  fulfill  the  constraint  of  consistency  and  transferability  for 
any  chosen  polygraph  data*’. 


**See  chapter  3.I.3.2. 

*^See  the  comparison  in  chapter  4. 1.2.3. 
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Fig.9:  An  example  for  one-dimensional  clustering 
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Fig.lO:  An  exmple  for  the  first  group  of  selected  features 
(  representing  group  #1  at  page ) 


1.3.2.  Multi-dimensional  search  for  the  best  feature  combination: 


3.1.3. 2. 1.  Overview: 


Having  obtained  these  four  sets  of  features,  a  multi-dimensional  searching  process  through 
all  of  them  was  initiated  to  find  the  best  feature  combinations  (concerning  the  quantity  and 
the  quality^®). 

Even  though  the  number  of  the  features^i  has  already  been  narrowed,  it  is  still  impractical 
to  do  an  exhaustive  search,  since  the  total  number  of  the  features  contained  in  these  four 
sets  is  about  100  for  each  polydatj.  In  other  words,  the  following  number  of 
computations  is  still  needed  for  calculation  of  all  4  or  less  possible  feature-tuples. 


4  A00"\ 

I  . 

i  =  lV  ‘ 


iJli!(100-i)! 


»4.0<10®  . 


At  this  stage,  I  decided  to  investigate  3  different  search  methods  to  bypass  the  exhaustive 
way.  They  are 

1 .  random  search  without  duplication  of  any  feature  within  a  tuple, 

2.  pseudo-exhaustive  search  with  the  option  of  duplication  and  finally 

3.  genetic  search  with  "uncontrollable"  possibility  of  duplications. 

In  previous  research  projects  [Layeghi  1993,1]  [Dastmalchil993]  [Jacobsl993],  it  was 
decided  to  narrow  the  feature  numbers  from  669  to  30  "best"  ones  and  then  an  exhaustive 
search  was  run  for  up  to  four-  or  five-tuple  combinations.  In  other  words,  their  strategy 
was  completely  different  than  the  aforementioned  three  strategies. 

As  mentioned  before  a  "poor"  or  an  average  single  feature  by  one-dimensional  clustering 
might  give  us  in  combination  with  other  features  very  good  or  even  better  results  by  a 
multi-dimensional  clustering  than  any  of  them  individually. 

This  fact  was  totally  neglected  by  the  feature  selection  methods  used  in  the  previous 
researches'^  [Laueghi 1993,1]  [Dastmalchil993]. 

20That  means:  How  many  features  and  which  ones  should  be  taken  in  a  combination. 

2*See  chapter  3.1.3. 1. 

^^See  chapter  4.3.  comparison  for  more  details  about  differences  between  this  and  previous  works. 
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Applying  these  three  new  strategies,  I  was  able  to  consider  more  possible  features  for  a 
multi-dimensional  clustering  than  in  previous  works,  without  using  the  impractical 
exhaustive  method. 


polygraph  data 


session  #1  session  #100 


Fig.  11:  General  search  to  find  the  best  feature  combination 


3. 1.3. 2. 2.  Random  search  method: 


Applying  this  method,  an  average  of  14  to  20  different  features  out  of  the  aforementioned 
four  sets  were  taken,  and  then  the  FCM  algorithm  including  the  evaluation  program  for 
randomly  chosen  4-tuples  were  run.  After  about  1000  combinations  were  constructed,  I 
then  picked  out  the  best  features  and  their  combinations,  and  replaced  the  poor  ones  with 
new  features.  This  same  procedure  was  repeated  until  good^^  combinations  were  found. 

23"Good"  in  sense  of  the  definition  in  chapter  1.1.2. 
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Every  time  the  results  were  out  of  balance  -  i.e.  highly  better  detection  either  for  deceptive 
or  non-deceptive  files  by  the  cost  of  the  other  one  -  I  appropriately  took  additional 
features  from  those  four  sets  to  eliminate  the  difference  by  improving  the  results  of  the 
worse  file  -  and  as  much  as  possible  -  by  maintaining  the  results  of  the  better  file. 

After  running  this  kind  of  tests  several  times,  we  were  able  to  estimate  which  features  are 
the  good  ones  to  combine  together. 


3.7.3.23.  Pseudo-exhaustive  search  method: 

Having  some  idea^'*  which  features  are  good  in  a  combination  with  others^^,  I  built  every 
possible  four-  to  six-tuples  out  of  those  features  and  evaluated  them.  This  method  was 
very  important  to  make  sure  that  we  did  not  lose  any  good  combinations  which  might 
have  been  neglected  by  the  random  search. 

I  called  this  method  "-exhaustive,  because  each  time  it  considers  only  a  small  part 

of  the  available  features;  but  "exhaustive”,  because  it  takes  all  the  possible  combinations 
within  this  part.  Except  for  this  major  difference,  all  the  other  steps  of  this  method  are 
exactly  the  same  as  the  random  search. 


3. 1.3. 2.4.  Genetic  search  method: 

This  algorithm  is  basically  a  compromise  between  the  pseudo-exhaustive  and  the  random 
search  method,  plus  a  weighting  system  which  supports  those  features  with  good  results. 

Initial  populations  of  200  to  300  chromosomes^^  are  randomly  created.  Each  chromosome 
is  a  combination  of  N  features,  where  N  stays  constant  for  each  population  during  the 
outgrowth.  Each  single  feature  is  selected  from  a  gene  pool  for  the  particular  population 
that  the  individual  belongs  to.  Each  gene  pool  consists  of  twenty  to  forty  features  that  we 
have  chosen^’. 

2'’By  using  the  results  of  the  random  search  method  and  also  the  5th  group  mentioned  at  page  3. 1.3.1. 
25Remember  the  fact  that  some  "poor"  single  features  might  give  us  in  combination  with  others  very 
good  results 

^^Individuals  or  feature-tuples. 

^^Directed  by  our  experience  from  using  the  random  and  the  pseudo-exhaustive  methods. 
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In  this  project  three  processes  operate  on  the  evolution^^  of  each  population; 

•  reproduction 

•  crossover 

•  mutation. 

These  three  processes  determine  how  each  new  generation  will  be  created  based  on  the 
old  one.  Before  genetic  reproduction,  the  fiizzy-c-means  algorithm  evaluates  the 
percentage  of  correct  deceptive  and  non-deceptive  detections  for  each  chromosome.  The 
average  of  them  is  the  fitness  value  of  that  chromosome.  During  the  genetic  reproduction, 
the  chromosomes  of  the  new  generation  are  copied  from  the  chromosomes  of  the  old 
generation  in  a  probabilistic  sense.  The  probability  that  a  particular  chromosome  will  be 
copied  is  the  ratio  of  that  chromosome's  fitness  value  against  the  total  fitness  values  of  the 
entire  population  of  the  old  generation. 

After  selection,  genetic  crossover  randomly  chooses  pairs  of  chromosomes  as  parents, 
splices  them,  and  recombines  them  -  by  randomly  mixing  some  of  the  parents  genes  -  into 
pairs  of  offsprings.  Finally,  genetic  mutation  randomly  substitutes  a  new  gene  within  a 
randomly  chosen  chromosome.  The  extent  to  which  crossover  and  mutation  occur  can  be 
verified  by  appropriate  initialization. 


Fig.  12:  An  example  for  the  genetic  outgrowth  with 
4  genes  (=features)  in  each  chromosome  (=individual) 


2*See  chapter  4. 1 .2.2  for  particular  results  of  this  method. 
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i?.  1.3.3.  General  process  -  Ovtimizatmi  by  chan^n?  parameters: 


Simultaneously  to  the  search  for  the  best  features  and  their  combinations,  we  were 
optimizing  the  system  by  changing  and  adjusting  the  parameters.  Recall,  the  whole  idea  of 
this  pattern  recognition  was  to  cluster  the  unlabeled  data  into  two  clusters  which  represent 
the  deceptive  and  the  truthful  group^^. 

Knowing  the  information  of  which  files  were  deceptive  or  truthflil^o,  were  able  to 
change  the  parameters  in  the  way  that  the  output  could  continuously  come  closer  to  the 
real  cluster  structure.  This  process  is  depicted  in  the  following  figure.  The  "fuzzy  c-means 
algorithm"  block  not  only  represents  the  pure  FCM  algorithm  shown  in  Fig.3,  but  also  the 
general  search  for  good  features  shown  in  Fig.  11  which  ran  simultaneously  with  the 
optimization  process. 


Fig.  13:  Optimization  of  the  clustering  environment 
-  General  process  - 


As  an  example,  I  will  briefly  discuss  how  the  parameter  m  was  chosen  and  eventually 
modified:  The  weighting  exponent  m  plays  a  significant  role  in  this  system.  Since  the 
control  parameter  m  itself  does  not  belong  to  the  optimizing  values  within  the  iterative 
process  of  FCM  algorithm,  one  must  choose  m  before  implementing  the  algorithm,  and 

^^Scc  chsptcr  3  1.2. 

know  this  information  beforehand  for  sure,  because  the  subjects  have  confessed  their  case  or  the 
actual  offender  was  found. 
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optimize  it  manually.  There  are  several  research  papers  written  as  an  attempt  to  find  the 
optimal  m  for  different  clustering  problems. 

The  effect  of  m  was  discussed  in  [Bezdekl981].  Although  Bezdek  proposed  heuristic 
guidelines  for  m,  no  theoretical  basis  for  an  optimal  choice  for  m  has  been  reported.  The 
only  known  paper  in  this  matter  [Choel9923  proposed  a  method  for  determining  m  based 
on  the  concept  of  fiizzy  decision  theory  initiated  by  [Zadehl970]. 

But  since  the  definition  of  "good"  clusters  in  [Choel992]  did  not  exactly  match  to  our 
clustering  environment,  I  chose  the  "trail  and  error"  strategy  to  find  the  optimal  m  by 
systematically  increasing  it.  Fortunately,  there  is  a  logical  limit^^  for  this  increasing  process 
in  our  case,  even  though  m  can  mathematically  be  any  value  from  [  2,  oo ). 


", "  represents  the  mmebership  values  form-2 
"  represents  the  membership  values  for  m-5 


Fig.  14:  An  example  for  the  influence  of ’m’ 


3>See  chapter  2.2.3.2.  for  the  meaning  of  m. 
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For  more  details  on  this  matter  see  the  chapter  4.1.1.  In  Fig.  14,  you  see  an  example  for 
how  the  weighting  exponent  m  influences  the  membership  values  for  one  of  the  features 
from  polydat_3  in  one-dimensional  mode. 

3. 1.3. 4.  Evaluation  stratesv: 


Due  to  the  small  number  of  non-deceptive  cases  available,  each  session  for  a  subject  was 
used  as  a  separate  and  individual  case.  But  in  average,  each  group  of  three  sessions  belong 
to  one  person  concerning  the  same  crime,  meaning  the  results  of  these  sessions  are  not 
independent  of  each  other.  Using  this  additional  information,  the  clustering  system  can 
come  closer  to  the  actual  structure  of  the  data,  i.e.  we  can  get  a  better  performance. 


polygraph  examination  for  a  deceptive  subject 


session 


session  #2 


session  #3 


d] 

© 


© 


FCM 


non-deceptive 


deceptive 


deceptive 


:  Control  question 
(r^  :  Relevant  question 


deceptive 


Fig.  15:  An  example  for  the  final  evaluation  using  the 
dependency  of  the  sessions 
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After  clustering  and  evaluating^^  each  session  separately,  some  cases  with  different 
responses  to  the  algorithm  were  found,  although  they  belonged  to  one  person.  In 
circumstances  like  this,  we  combined  the  individual  results  within  each  group  in  a  way  that 
the  majority  response  was  assigned  to  the  whole  group  (see  Fig.  15). 

In  those  cases  that  each  polygraph  examination  contains  2  or  4  test  sessions  where  there  is 
no  majority  response  to  build,  I  decided  to  take  only  those  membership  values  further  to 
the  threshold  0.5.  For  example,  by  the  feature  combination  [30,  30,  39,  235,  363,  450] 
used  to  cluster  polydat_l,  we  obtained  for  one  of  the  examination  with  four  sessions  the 
following  membership  values:  0.4164,  0.5519,  0.5377,  0.4780.  After  defuzzification  we 
got  0,  1,  1,  0  where  no  majority  class  can  be  build.  However,  the  second  and  the  third 
membership  values  are  closer  to  the  threshold  than  the  other  two  ones.  With  the 
aforementioned  strategy,  this  examination  is  labeled  with  0. 

Recall  that  each  polygraph  examination  has  a  set  of  control  and  relevant  questions  which 
is  repeated  an  average  of  three  times.  The  only  difference  between  each  session  is  the 
order  in  which  the  questions  are  asked. 


32The  general  evaluation  process  is  contructed  as  following: 

After  each  clustering  procedure  (  one-  or  multi-dimensional )  a  two-row  vector  of  membership  values  is 
given  which  represent  the  two  deceptive  and  non-deceptive  clusters.  The  evaluation  process  takes  the 
membership  values  of  one  these  clusters  and  counts  the  values  below  and  above  the  threshold  0.5.  Thus,  as 
a  result  we  get  the  absolute  number  of  wrong  and  right  detections. 
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3.2.  Part  II  -  LMS  fuzzv  adaptive  filter 


S.2.1.  Feature  selection  bv  visual  inspection: 

One  advantage  of  a  fuz2y  logic  system  is  its  use  of  common  sense  human  reasoning  as 
inference  rules.  The  fuzzy  LMS  algorithm  we  used  extends  this  advantage  by  further 
optimiang  such  inference  rules  to  "fit"  a  given  set  of  data.  To  fully  utilize  the  advantages 
of  this  fuzzy  LMS  algorithm,  we  had  to  face  two  issues:  coming  up  with  the  proper 
intuitive  rules  for  initialization  and  a  set  of  data  that  reflects  real-world  examples  for 
training. 

As  mentioned  before,  for  practical  reasons,  the  polygraph  recognizer  can  use  only  a  subset 
of  the  given  669  features,  and  we  would  have  to  choose  the  effective  ones.  Furthermore, 
the  fiizzy  logic  system  needed  reasoning  rules,  operating  on  those  features  we  selected,  to 
analyze  the  data.  We  believed  that  we  could  visually  inspect  graphical  plots  of  the  feature 
data  to  learn  about  the  feature  information.  Since  fuzzy  logic  corresponds  closely  with 
human  reasoning,  we  would  then,  based  on  the  knowledge  obtained  from  our  visual 
inspection,  select  features  that  help  differentiate  deceptive  and  non-deceptive  subjects  and 
codify  the  patterns  we  would  find  into  reasoning  rules. 


For  the  visual  inspection,  a  scatter  plot  was  made  of  the  data  in  polydat_3  of  each  single 
feature.  We  looked  at  each  plot  individually.  In  any  given  plot,  if  the  deceptive  and  non- 
deceptive  subjects  showed  distinctive  clusters,  then  the  feature  was  considered  good.  If 
the  elements  of  these  two  classes  seemed  to  be  randomly  located,  then  the  feature  was 
considered  bad.  After  viewing  all  669  plots,  we  subjectively  determined  the  following 
features33  to  be  very  good:  9,  1 1,  29,  164,  399, 449,  450,  451, 452,  and  454;  with  451  and 
452  to  be  the  best. 

Initially  the  fiizzy  adaptive  filter  was  to  be  designed  based  on  two  features,  with  more 
features  to  be  added  in  the  future  as  the  project  progresses.  We  limited  the  feature  couple 
to  be  composed  of  good  features  from  the  above  list.  Visual  inspection  was  made  of  the 
scatter  plots  of  the  data  in  polydat_3  of  various  such  feature  combinations  to  determine 
the  effective  ones.  While  selecting  feature  couples,  we  again  searched  for  combinations 
that  show  distinctive  clusters  for  deceptive  and  non-deceptive  subjects.  The  features 


33See  Fig.41  for  the  meaning  of  these  numbers. 
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within  a  combination  should  also  be  uncorrelated  with  each  other.  A  plot  of  the  feature 
449  and  450  combination  shows  that  they  are  a  bad  couple  because  they  seem  to  be 
linearly  correlated^'*,  as  the  data  points  fall  closely  along  a  straight  line. 


Fig.l6:  Scatter  plots  of  two  linearly  correlated  features 


Visual  inspection  of  feature  couples  consumed  much  more  time  than  visual  inspection  of 
individual  features,  as  the  clusters  took  on  more  complicated  shapes.  Furthermore,  in  the 
fuzzy  LMS  algorithm  each  inference  rule  exerts  influence  centered  in  an  elliptical  contour 
where  the  major  and  minor  axes  are  parallel  with  the  axes  of  the  feature  plot.  Clusters  with 
a  complicated  shape  must  be  built  from  those  elliptical  regions  (see  next  figure).  Therefore 
we  had  the  additional  task  of  finding  clusters  in  the  feature  plots  that  could  be  easily 
approximated  with  few  ellipses,  to  reduce  system  complexity. 

Due  to  the  lack  of  time,  we  did  not  examine  the  plots  of  all  forty-five  possible 
combinations  of  the  ten  very  good  features  listed  above.  We  only  examined  a  random  few. 
Based  on  the  ones  we  did  examine,  we  settled  on  the  combination  of  features  451  and  452 
because: 

^‘’Correlation  between  two  features  means  that  information  in  one  is  similar  to  the  information  in  the 
other  one,  and  using  them  together  only  introduces  redundancy  and  hardly  improves  the  system. 
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•  they  were  the  best  -  visually  recognizable  -  features  individually, 

•  they  seemed  uncorrelated  with  each  other  and 

•  we  roughly  found  four  elliptical  clusters  from  the  plot. 


non-deceptive  flies 
'o*:  deceptive  files 
Polydat’3 


Fig.  17:  The  four  elliptical  clusters  used  for  setting  the  linguistic  rules 
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3.2.2.  Setting  linguistic  rules: 


We  initialized  the  fuzzy  system  such  that  it  would  exploit  the  knowledge  we  had  just 
obtained  about  the  clusters  for  features  451  and  452.  There  were  two  inputs,  one  for  each 
feature,  and  four  rules,  one  for  each  cluster.  We  had  to  represent  those  visual  clusters  we 
found  with  inference  rules.  The  linguistic  rules  are  shown  in  the  following  figure. 


1.  IF  fl  is  about  -1  (±0.5)  and  f2  is  about  -0.5  (+0.8), 

THEN  decision  is  non-deceptive  =>  output  is  +1. 

2.  IF  fl  is  about  0  (+0.5)  and  f2  is  about  -0.25  (±0.25), 

THEN  decision  is  non-deceptive  =^>  output  is  +1. 

3.  IF  fl  is  about  0  (±0.1)  and  f2  is  about  0  (±0.2), 

THEN  decision  is  deceptive  =>  output  is  -1. 

4.  IF  fl  is  about  1  (±0.6)  and  £2  is  about  0.3  (±0.5), 

THEN  decision  is  deceptive  =>  output  is  -1. 

fl:  measurement  of  feature  #  451 
_ f2:  measurement  of  feature  #  452 


Fig.  18:  Initial  linguistic  rules  for  the  fuzzy  adaptive  filter 
based  on  the  clusters  in  Fig.  17 

The  linguistic  rules  above  were  then  translated  to  fiizzy  membership  functions  as  outlined 
in  [Wang 1994].  The  xi's  were  the  centers  of  the  clusters;  the  sigmas  were  the  widths  of  the 
clusters  (±xxx  in  the  above  rules);  and  the  thetas  were  either  +1  or  -1  for  non-deception 
and  deception,  respectively. 

The  output  of  the  fuzzy  reasoning  based  on  the  above  four  rules  would  not  be  exactly  +1 
or  -1.  It  would  be  within  the  range  limited^^  by  +1  and  -1.  For  our  project,  we  decided 
that  a  positive  output  denotes  non-deception  and  a  negative  output  denotes  deception.  In 
other  words,  the  decision  threshold  was  at  zero. 

After  training  the  output  may  go  beyond  that  range. 
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For  future  investigations  one  may  experiment  with  a  different  threshold^^. 

The  choice  of  plus  and  minus  one  for  non-deception  and  deception  is  based  on  the 
following  argument:  The  learning  technique  uses  the  squared  error,  which  is  the  square  of 
the  difference  between  the  desired  output  and  actual  output.  In  computing  that  squared 
error,  if  the  difference  between  the  desired  output  and  actual  output  is  greater  than  one, 
then  the  squaring  operation  expands  the  error  value  and  therefore  gives  more  significance 
to  such  mistakes.  On  the  other  hand,  if  the  difference  is  less  than  one,  than  the  squaring 
operation  compresses  the  error  value  and  therefore  gives  it  less  significance. 

Given  zero  as  the  threshold  between  deception  and  non-deception  and  assuming  the  actual 
output  would  never  go  beyond  plus  two  or  minus  two,  then  the  choice  of  plus  and  minus 
one  as  desired  outputs  would  mean  that  the  error  calculation  gives  more  significance  to 
misclassifications  and  less  to  correct  classifications;  Here  classification  refers  to  the  crisp, 
defuzzified  classification,  not  the  degree  of  belonging. 

For  example,  the  desired  output  for  non-deceptive  subjects  is  plus  one.  If  the  actual  output 
is  between  zero  and  two,  then  the  crisp  classification  is  non-deception,  which  is  correct. 
The  numerical  difference  between  the  actual  output  and  the  desired  output  is  less  than  one 
in  this  case,  and  the  squaring  operation  would  lessen  the  significance  of  that  error.  On  the 
other  hand,  if  the  actual  output  is  less  than  zero,  then  the  crisp  classification  would  be 
deception,  which  is  wrong.  In  that  case,  the  numerical  difference  between  the  desired 
output  and  the  actual  output  is  greater  than  one  and  more  significance  would  be  given  to 
such  mistakes.  Similar  argument  can  be  apply  for  the  choice  of  minus  one  as  the  desired 
output  for  deceptive  subjects. 


3.2.3.  Training,  testing  and  evaluation  stratesv: 

The  fuzzy  LMS  algorithm  can  be  optimized  to  a  specific  set  of  data.  To  exploit  that  aspect 
of  the  algorithm,  we  also  selected  a  set  of  data  to  train  the  system.  Following  a  procedure 
similar  to  one  used  in  an  earlier  project  with  KNN  classifying  algorithm  [Layeghil993], 
we  had  35  deceptive  subjects  and  35  non-deceptive  subjects  -  from  each  polydatj  -  for 

^^One  may  also  view  the  output  as  a  fuzzy  value  and  map  it  to  a  confidence  value  in  addition  to  just  a 
deception/non-deception  decision.  That  would  differentiate  a  sure  Judgment  from  an  unsure  one  and  may 
be  more  helpfiil  in  practice. 
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training.  However,  with  a  set  of  only  100  subjects  within  each  polydatj,  that  left  a  rather 
small  amount  for  testing  (i.e.  15  deceptive  and  15  non-deceptive  subjects).  Therefore  we 
also  tested  the  algorithm  with  10  deceptive  subjects  and  10  non-deceptive  subjects  for 
training  and  the  rest  (40  deceptive  subjects  and  40  non-deceptive  subjects)  for  testing. 
That  might  be  a  bit  extreme  in  the  other  direction,  but  we  could  interpolate  the  results  and 
also  see  the  sensitivity  of  the  algorithm  to  the  amount  of  training  data. 

We  tested  both  cases  for  all  three  polydatj's,  giving  a  total  of  six  tests.  Each  test  was 
repeated  twenty  times.  The  training  data  were  randomly  chosen  each  time,  and  the  rest  of 
the  available  data  in  each  set  were  used  for  testing.  We  recorded  for  each  test  the  average 
of  those  twenty  trials.  This  repeated  testing  was  done  to  ensure  that  the  results  were  not 
dependent  on  a  particular  choice  of  training  data. 
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3.2.4.  What  to  do  with  the  memorizing  problem? 


Most  learning  algorithms  suffer  the  dilemma  of  overlearning,  or  memorizing.  Usually  the 
problem  occurs  when  the  learning  algorithm  tries  too  hard  at  optimizing  itself  to  a  set  of 
training  data,  sometimes  to  the  point  of  memorizing  them,  such  that  it  does  not  generalize 
to  understand  new  data.  Overlearning  is  exacerbated  when  the  training  data  set  is  not 
completely  representative  of  the  testing  set. 

In  a  pattern  recognition  problem,  while  the  recognition  rate  for  the  training  data  may 
increase  steadily  until  it  reaches  a  certain  plateau,  the  recognition  rate  for  testing  data  may 
only  increase  for  a  while,  after  which  it  may  decrease  until  it  hits  a  plane.  We  observed 
such  phenomenon  in  our  system: 


The  training  data  consist  of  twenty  non-deceptive  subjects  and  twenty  deceptive  subjects  from  polydat_3. 

_ The  testing  data  consist  of all  one  hundred  subjects  from  polydat_L 


Fig.l9:  An  example  for  memorizing  as  the  system  "leams" 

The  point  where  the  recognition  rate  starts  to  decrease  marks  the  beginning  of 
overlearning.  In  practical  applications,  most  adaptive  learning  algorithms  are  trained  only 
to  the  point  before  overlearning  occurs,  when  the  performance  on  the  testing  data  reaches 
its  peak. 
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In  our  testing  we  had  taken  that  approach  and,  for  each  trial,  the  percentage  of  correct 
recognition  was  taken  as  the  maximum  attained  for  the  testing  data  within  forty  epochs^"^. 

We  disregarded  the  recognition  rate  for  the  training  data  because  for  many  systems, 
including  our  own,  a  proper  set-up  could  easily  attain  a  recognition  rate  of  100%.  That  is, 
the  recognition  rate  of  the  trmning  data  bears  little  importance  in  practical  applications. 


^’An  epoch  is  defined  as  one  complete  cycle  through  all  the  training  data. 
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§4.  RESULTS  AND  CONCLUSIONS 


4.1.  Fuzzv-c-means 


4. 1. 1.  Searching  for  the  best  level  of  fuzziness  ( parameter 'm' ): 

One  of  the  major  steps  during  the  one-dimensional  clustering  was  the  searching  process 
for  the  best  value  of  For  this  process,  it  was  necessary  to  run  the  FCM  algorithm  for 
different  m's  and  for  different  data  by  increasing  m  systematically.  This  was  done  for  all 
669  features  and  for  each  polydat_i,  by  every  new  m. 

Recall  that  it  was  decided  to  consider  four  groups  of  features  to  limit  the  feature  pool  for 
multi-dimensional  clustering.  Even  though  the  general  development  -  while  changing  m  - 
was  similar  for  each  polydat_i,  the  individual  reaction  of  these  4  groups  within  each 
polydatj  was  a  little  different.  For  the  final  decision,  we  considered  all  these  variances, 
correct  detection  rates  and  also  the  distributions  of  the  membership  values  for  each  m. 

In  the  following,  I  will  mention  some  of  the  remarkable  observations  we  have  made  during 
this  process  (see  also  the  following  tables  and  figures  representing  the  results  of 
polydat_3): 

As  expected,  the  membership  values  did  approach  the  0.5-leveP^  by  increasing  nj,  i.e. 
the  results  became  fuzzier.  Thus,  we  had  to  limit  the  increasing  process  to  avoid  the 
uncertainty  of  the  results  caused  by  too  much  "fuzziness"  (which  means  that  every  person 
belongs  to  both  clusters  with  almost  the  same  possibility).  However,  we  could  observe  a 
very  interesting  phenomenon.  Even  though  the  membership  values  came  closer  to  0.5,  and 
the  distances  for  different  persons  to  this  level  were  around  10’*  (with  x>  3),  they  were 
still  visually  recognizable  as  deceptive  and  truthful  clusters. 

See  the  following  two  figures  and  also  the  Fig.  14  for  examples.  Notice  that  the  first  50 
sessions  represent  the  non-deceptive  persons  and  the  other  50  the  deceptive  ones. 


3*See  also  chapter  3. 1.3.3.  for  the  discussion  about  finding  the  best  m. 
39see  chapter  2.2. 3.2.  for  more  details. 
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membership  valu< 


Fig.20:  Influence  of  increasing ’m’  for  polydat-3  session  #1 
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In  the  following  two  tables,  the  influence  of  changing  m  (for  polydat_3/group  #1,  as  an 
example)  is  depicted.  As  mentioned  earlier  this  group  represents  those  features  which  give 
us  better  than  60%  right  detection  for  both  deceptive  and  non-deceptive  files  by  one¬ 
dimensional  clustering. 

As  you  see  in  these  examples,  while  increasing  the  parameter  in,  new  "good"  features 
appear.  Some  old  ones  provide  even  better  detection  rates  and  some  get  worse  or  even 
disappear.  This  progress  is  not  unlimited.  As  you  see,  the  development  fi'om  'm=4'  to  'm=5' 
is  smoother  than  between  'm=2'  and  'm=4'  regardless  of  'm=3'  step.  By  continuing  this 
process  above  'm=5',  the  tendency  becomes  rather  negative. 

Those  features  marked  with  (*)  represent  a  better  detection  rate  than  75%  at  least  in  one 
of  the  two  clusters.  Notice  that  these  features  also  change  during  the  increasing  process  of 
m.  By  continuing  this  process  above  'm=5',  also  this  tendency  becomes  rather  negative. 

After  considering  the  other  groups'*®  and  their  development  for  each  polydat_i,  'm=5' 
appeared  to  be  the  best  compromise.  Notice  that  there  is  also  an  outstanding  result  for 
feature  number  452  by  'm=5'  (see  Fig.23).  That  was  the  only  inidividual  feature  ever  by  an 
one-dimensional  clustering  process  with  a  correct  detection  rate  of  90%  for  non-deceptive 
files. 

Another  interesting  aspect  is  that  independent  of  m,  the  conglomeration  areas  where 
"good"  features  appear  are  always  the  same:  For  example  the  half  of  the  "good"  features 
are  among  the  first  hundred,  but  between  200  and  300,  there  is  only  one. 

In  the  next  tables  we  will  use  the  following  abbreviations: 

Feature  number. 

Wrong  detection  within  the  deceptive  cluster  in  percent. 

Wrong  detection  within  the  non-deceptive  cluster  in  percent. 

Features  with  a  better  detection  rate  than  75%  at  least  in  one  of  the  two 
clusters. 

'in=...'  MINUS  Represents  the  difference  in  detection  rates  by  using  different  w's. 


ft#: 

w_dcp: 

w_non: 


'*®SeeFig.8. 
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Fig.22:  Comparison  between  the  results  for  ’m=2’  and  ’m=4’ 
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group  #1  &  in=S 


ft# 

w_dcp 

w_non 

1.0000 

24.0000 

18.0000 

3.0000 

32.0000 

20.0000 

4.0000 

24.0000 

36.0000 

5.0000 

30.0000 

32.0000 

12.0000 

40.0000 

30.0000 

15.0000 

24.0000 

18.0000 

17.0000 

32.0000 

20.0000 

18.0000 

24.0000 

34.0000 

19.0000 

30.0000 

32.0000 

22.0000 

24.0000 

28.0000 

29.0000 

36.0000 

18.0000 

30.0000 

24.0000 

20.0000 

31.0000 

28.0000 

32.0000 

33.0000 

36.0000 

32.0000 

36.0000 

30.0000 

16.0000 
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for  the  abbreviations  see  page  46 


Fig.23:  Comparison  between  the  results  for  ’m=4’  and  'm=5' 
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4. 1.2.  Searching  for  the  best  feature  combination: 


4.1.2.L  Results  of  the  conventional  methods  and  general  observations: 

As  mentioned  in  chapter  3. 1.3. 2.1,  we  decided  for  three  different  strategies  to  find  out  the 
best  feature  combination  that  can  represent  the  two  sought  clusters  within  the  polygraph 
data. 

After  a  short  while  of  a  "trial-and-error"  testing  with  the  multi-dimensional  clustering 
algorithm  and  achieving  some  experience  about  how  well  which  features  are  in  a 
combination  with  others,  I  decided  to  start  a  systematic  searching  process  beginning  with 
four-tuple  combinations.  In  the  followings,  I  will  mention  some  of  the  general 
observations'**  we  made; 

•  not  always  all  of  the  good  one-dimensional  features  were  represented 
within  the  best  feature  combinations, 

•  good  one-dimensional  features  with  the  same  detection  rate  did  not 
provide  the  same  results  within  coequal  combinations, 

•  some  poor  or  average  individual  features  turned  out  to  be  the  best 
features  in  a  combination  with  others, 

•  by  repeating  some  features  in  a  combination,  we  obtained  a  few  new 
good  combinations, 

•  good  feature  combinations  always  gave  us  better  results  than  any  of  the 
features  individually  and 

•  the  quality  of  the  feature  tuple  does  not  depend  on  the  order  of  the 
features  within  the  tuple. 

In  the  following  tables,  you  see  an  example  for  using  the  random  search  method  for 
polydat_3  ('m=2'  and  'm=5')  for  four-tuple  combinations. 


41  See  also  chapter  4.3. 
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feature  number 

=  {1,4,3,9,22 

29, 30, 36, 37, 39, 450, 457, 458, 460] 

condition:  if  (  (  (nn>-80)  &  (ww>-80)  )  1  (  (nn>’ 

=S(5j  1  (ww> 

=86))  ) 

table  1 

feature  Dositions 

right  detection 

feature  nositions 

rieht  detection 
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dcp-ok 
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14 
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feature  number  =  {1, 4, 3, 8, 

9,18,22,29, 

30,36, 37,  39,81»457} 

condition:  if  (  (  (nn>-80)  &  (ww>-80)  ) 

1  ((nn> 

&  (ww>^7^)  ) 
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Fig.  24.1:  Feature  combinations  by  'random  search'  -  polydat_3,  'm=2' 
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feature  number  =  {1, 4, 3, 2x  9,  22, 30, 36, 37, 81, 308, 457, 4^} 

condition:  if(  ((nn>=80)  &  (ww>=80) )  \  ((nn>=86)  &  (ww>=78) )  ) _ 

table  3 
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non-ok  dcp-ok 
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Fig.  24.1;  Continued 
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1  feature  number  =  {1, 4, 3, 8, 9, 21, 22, 30, 35, 36, 81, 198, 457, 459} 

condition:  if  (  (  (nn>-80)&(ww>-80)  ) 

1  ( (nn>- 

11 

A 

oo 

II 

table  3 

feature  positions 

right  detection 

non-ok 

dcp-ok 

1  8 

5  4 

86 

78 

7  1 

8  14 

86 

78 

7  1 

8  5 

86 

78 

4  2 

8  3 

86 
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3  2 
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86 
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8  1 
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3  4 
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8  2 
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78 

5  8 

13  1 

86 

78 

1  4 

13  8 

86 

78 

feature  number  =  {1, 4, 3, 

8, 9, 22,30, 35.51.111 

.210. 455. 457.459} 

condition:  if  (  (  (nn>^80)&(ww>^80)  ) 

1  ((nn> 

&  (ww>-79) )  ) 

table  4 

feature  positions 

right  detection 

non>ok 

dcp-ok 

7  5 

10  6 

80 

80 

6  4 

7  10 

80 

80 

7  4 

10  5 

80 

80 

Fig.  24.1:  Continued 


feature  number  =  {1, 3, 4, 8, 9, 22, 30, 37, 81,  111 

condition:  if  (  (  (nn>=80)  &  (ww>=80)  )  \  (  (nn>= 

,  452, 450, 459, 460} 

86)  &  (ww>=79)  )  ) 

table  1 

feature  positions 

riffht  detection 
non-ok  dcp-ok 

feature  positions 

right  detection 
non>ok  dcp-ok 
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12 

5 
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1 

2 

80 
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1 

6 
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Fig.  24.11:  Feature  combinations  by  'random  search’  -  polydat_3,  'm=5' 


52 


feature  number  =  {1, 4, 8, 

9, 22,30 

32, 37,  81, 452, 450, 459, 452} 

condition:  if((  (nn>^8})  &  (ww>= 

W)  1  ((m> 

II 

A 

00 

11 

table  2 

feature  nositions 

right  detection 

Don-ok  dcp-ok 

1  6 

4  10 

86 

80 

6  4 

1  10 

86 

80 

1  12 

3  10 

86 

80 

1  12 

13  14 

86 

80 

3  6 

1  10 

86 

80 

6  10 

5  1 

86 

80 

4  6 

10  1 

86 

80 

10  3 

1  6 

86 

80 

3  12 

10  1 

86 

80 

1  12 

10  5 

86 

80 

10  12 

1  14 

86 

80 

Fig.  24.11:  Continued 


After  running  similar  simulations  for  different  w's  with  randomly  chosen  features  from  the 
pool  of  the  aforementioned  five'’^  groups,  I  started  a  sequence  of  pseudo-exhaustive 
searches  with  those  features  from  which  we  received  good  results  by  random  search. 

For  this  sequence  of  simulations  the  parameter  m  was  set  equal  to  5.  We  started  with 
four-tuple  combinations  out  of  a  pool  of  14  features  (4/14).  We  then  gradually  increased 
the  number  of  the  features  -  within  the  tuple  and  the  pool  -  up  to  8/22.  To  run  the 
simulation  with  this  final  setting,  we  needed  a  computation  time  of  several  weeks. 

In  the  following  figures,  you  see  an  example  for  one  of  the  best  4-tuple  results  we 
obtained  for  the  polydat_3; 

4-tuple  combination:  81  &  111  &  450  &  452'’^. 

dimension:  polygraph  session. 

correct  detection  rate:  84%  for  non-deceptive  and  86%  for  deceptive  files. 

dimension:  polygraph  examination'*^  -  containing  1  to  4  sessions. 

correct  detection  rate:  89%  for  non-deceptive  and  94%  for  deceptive  files. 

dimension:  polygraph  examinations  with  more  than  two  sessions. 

detection  rate:  100%. 

^^See  Fig.  8  for  four  of  them  and  page  25  for  the  additional  fifth  one. 

'‘^For  information  about  the  exact  meaning  of  these  feature  numbers,  see  Fig.41. 

^See  "Evaluation  strategy"  in  chapter  3. 1.3.4. 
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Fig.25:  Defuzzified  results  for 
[81-111-450-4521  feature  combination 


54 


0.5555 

1.0000 

0.5692 

1.0000 

0.5650 

1.0000 

1 

0.4418 

0 

0.6468 

1.0000 

0.5009 

1.0000 

1 

0.5593 

1.0000 

0.5596 

1.0000 

0.4109 

0 

1 

0.6002 

1.0000 

0.5550 

1.0000 

0.5148 

1.0000 

1 

0.5964 

1.0000 

0.6112 

1.0000 

0.6224 

1.0000 

1 

0.7130 

1.0000 

0.5834 

1.0000 

0.5844 

1.0000 

1 

0.5472 

1.0000 

0.5758 

1.0000 

0.5924 

1.0000 

1 

0.5879 

1.0000 

0.6284 

1.0000 

0.6078 

1.0000 

1 

0.3902 

0 

0.5399 

1.0000 

0.4636 

0 

•0 

misclustered 

deceptive  files 
polydat_3 
m^5 


Uik  ( 

defuzzification  ner 

session 

test 

0.6374 

1.0000 

0.5389 

1.0000 

0.5094 

1.0000 

0.5696 

1.0000 

0.4185 

0 

0.5057 

1.0000 

— - 1 

0.5508 

1.0000 

0.5237 

1.0000 

0.5533 

1.0000 

0.5878 

1.0000 

0.5941 

1.0000 

1 

0.4533 

0 

0.5383 

1.0000 

0.5316 

1.0000 

....  1 

0.5452 

1.0000 

0.5266 

1.0000 

0.3128 

0 

....  1 

0.5068 

1.0000 

0.5735 

1.0000 

0.6276 

1.0000 

....  1 

0.5504 

1.0000 

0.5706 

1.0000 

0.5542 

1.0000 

_ 1 

Fig.25:  Continued 
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4. 1.2. 2,  Results  of  the  genetic  method: 


Simultaneously  to  the  aforementioned  sequence  of  searches,  I  started  with  a  compromise 
between  the  random  and  the  pseudo-exhaustive  search  method;  i.e.  the  genetic  alternative. 
I  decided  to  use  this  method  in  two  different  ways: 

1 .  In  order  to  increase  the  number  of  potentially  good  features  in  the  pool,  I 
initialized  the  genetic  code  with  up  to  50  features  from  which  (in  different 
simulations)  4-,  6-,  8-tuple  combinations  were  made. 

2.  In  order  to  accelerate  the  search,  but  process  the  data  more  exhaustively, 

I  decided  to  use  the  genetic  code  only  for  the  best  features  from  random 
and  pseudo-exhaustive  simulations  and  narrow  the  feature  pool  to  these 
30  selected  features.  In  this  simulation,  15-tuple  combinations  were  made. 


Recall  that  having  30  or  50  features  in  the  pool  makes  a  big  computation  difference.  For 
example,  choosing  exhaustively  8-tuples  out  of  50  or  30  features  makes  a  difference  of 
following  number  of  computations: 


^50^ 


^30^ 


50! _ 30! 

8!(50-8)!  8!(30-8)! 


5-10* 


In  the  first  part  of  the  genetic  search  -  as  expected  -  we  had  similar  problems  as  scientists 
have  with  the  theory  of  evolution  as  the  cause  of  our  being^^  The  only  way  we  could  get 
the  following  good  results  was  the  continuous  manipulating  of  the  evolution  process  -  by 
changing  parameters  (like  mutation  rate),  features  (=genes)  and  feature  numbers 
(=population  size  and  also  number  of  genes  in  one  chromosome),  or  by  starting  again  if 
the  simulation  began  with  a  very  low  detection  rate  (=average  fitness).  In  spite  of  these 
manipulations  the  first  version  of  the  genetic  search  took  a  simulation  time  of  over  two 
months  of  continuous  computation.  Without  the  constant  controlling  process  over  this 
genetic  system  the  evolution  (by  chance  as  it  is  its  nature)  could  have  hardly  provided  any 
appropriate  improvement^.  As  a  result  we  obtained  12  (see  Fig.26)  8-tuples  combination 


^^Further  discussion  about  "evolution  vs.  creation"  would  break  up  the  limitations  of  this  project;  For 
interested  readers  I  recommend  the  following  references:  |Morrisl987]  [Johnsonl991J. 

^For  example,  one  of  the  uncontrolled  simulation  for  polydat_l  was  stopped  after  561  generations 
providing  no  particular  results. 
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with  an  average  of  85%  correct  detection  rate  for  polydat_3  similar  to  the  results  of  the  4- 
tuple  combination  mentioned  in  chapter  4. 1.2.1.  We  also  obtained  3  outstanding  (86% 
correct  detection  rate)  individuals  within  three  different  generations  (population  size  of 
200  to  300,  total  number  of  generation  1000,  polydat_3). 


feature  numbers  of  the  best 
8-tuple  combinations 

8.30.81.81.111.363.458.482 
9,37,81,111  ,  111,449,458,460 

9.37.111.111.449.457.457.482 
9, 37,  111 ,  111  ,358,449,457,458 
9,37,  111,  111,235,449,457,460 
37,79,  111 ,  111 ,  197,358,449,457 
37,  111 ,  111  ,  197,449,457,460,460 
37  ,  111  ,  111 ,  111 ,  235 , 358 , 457 , 458 
37  ,  111  ,  111 ,  235 , 235 , 449 , 453 , 457 
37  ,  111 ,  111  ,  197 , 358 , 361 , 458 , 460 

37 . 81 . 1 11 . 235 . 235 . 363 . 450 . 453 

37.81. 111.235.235.359.450.453 

37,79,  111 ,  111 ,  197 , 235 , 449 , 457 
37  ,  111 ,  111 ,  235 , 235 , 453 , 457 , 460 
37,  111 ,  111 ,  197,235,452,457,460 


correct  detection  rate 


ndcp 

dcp 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

84 

86 

86 

84 

86 

84 

86 

86 

86 

86 

86 

86 

ndcp:  non-deceptive  files 
dcp:  deceptive  files 
data:  polydatS 

Fig.26:  Results  of  the  first  version  of  the  genetic  search 

Concerning  the  defuzzified  results,  all  the  combinations  with  85%  correct  detection  rate 
show  similar  structure  as  depicted  in  Fig.25.  The  three  best  8-tuple  combinations  (86% 
correct  detection  rate)  cluster  the  data  exactly  in  the  same  groups  as  shown  in  the 
following  figure. 
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0.4289 
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0 

0.5548 
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0.4135 
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non-decevtive  files 
polydat_S 
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Fig.27:  Defuzzified  results  for 
[37-111-111-197-235-452-457-460]  feature  combination 
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polydat_3 
m=5 


Fig.27:  Continued 


The  followings  are  the  clustering  results  of  the  best  8-tuple  combinations  for  polydat_3. 


dimension:  polygraph  session^*’. 

detection  rate:  86%  for  both  non-deceptive  and  deceptive  files. 


dimension:  polygraph  examination  -  containing  1  to  4  sessions. 

rnrrf>r.t  detection  rate:  94%  for  both  non-deceptive  and  deceptive  files. 


dimension:  polygraph  examinations  with  more  than  two  sessions 

detection  rate:  97%. 


In  the  second  part  of  the  genetic  search  as  we  fed  the  evolution  process  with  the  best 
features,  we  obtained  after  about  3  weeks  of  continuous  simulation  the  following  results: 


twelve  15-tuple  combinations;  (the  features  in  each  tuple  are  ordered  vertically) 


37 

11 

8 

8 

37 

30 

11 

30 

11 

11 

11 

111 

11 

11 

37 

81 

32 

30 

32 

30 

30 

30 
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36 

37 

50 

81 

32 

32 

39 

32 

32 

32 
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36 

111 

79 

81 

32 

39 

81 

39 

39 
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37 
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36 

81 

81 

81 

79 
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81 

81 

81 
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361 
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197 
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458 
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450 
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84 

84  :non-deceptive  files 

86 

86 

86 

86 

86 

86 

86 

86 

86 

86 

86  ideceptive  files 

_ _ _ _ 

polydat_3, 

Fig.28:  Results  of  the  second  version  of  the  genetic  search 


47See  "Evaluation  strategy"  in  chapter  3. 1.3.4. 
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Fig.29:  Average  fitness  of  each  generation 
provided  by  the  second  version  of  the  genetic  search 


As  you  see  in  this  figure,  the  average  fitness  (from  all  the  chromosomes  within  a 
generation)  increases  over  the  period  of  time.  It  then  approaches  a  local  asymptote  which 
represents  a  local  error  minimum.  By  increasing  the  mutation  rate  after  the  150th 
generation,  we  could  avoid  being  stuck  in  that  local  minimum  for  further  development. 
This  higher  mutation  rate  helped  the  evolution  process  getting  a  1%  better  average  fitness 
per  generation  for  the  rest  of  the  simulation. 

Our  hope  for  this  simulation  was  to  get  outstanding  chromosomes  with  a  very  high  fitness 
simultaneously  to  the  increasing  process  of  the  average  fitness  per  generation.  However, 
the  outstanding  chromosomes  appeared  unsystematically  in  different  generations  and  not 
at  the  end.  In  fact,  most  ofthem'*^  belong  to  the  first  part  of  this  evolution. 

'•^See  the  begining  of  this  chapter  for  more  details. 

^^See  Fig.28  for  the  best  feature  combinations. 
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4. 1.2.3.  Final  results  of  FCM-  A  comparison  betyveen  all  three polydatj's: 

All  the  aforementioned  results  belong  to  the  data  set  polydat_3,  and  all  the  three  methods, 
(1)  previous  researches  using  the  fuzzy  K-nearest  neighbor  (KNN)  classifier,  (2)  the  LMS 
fuzzy  adaptive  filter  and  also  (3)  the  fuzzy-c-means  algorithm  show  that  the  data  structure 
within  the  polydat_3  is  better  to  cluster  or  classify  than  the  other  two  sets. 

As  it  is  the  nature  of  a  clustering  versus  a  classifying  method,  I  did  not  set  the  highest 
priority  on  finding  the  same  best  features  for  all  three  polydatj's,  but  for  each  of  them 
individually.  After  finding  those  best  combinations,  I  then  compared  the  results  and  tested 
the  consistency  of  the  features  (see  Fig.  33,  34,  35). 


Using  either  sessions  or  examinations^^  as  the  counting  dimension  the  best  results  for  each 
polydatj  individually  are  shown  in  the  following  figures. 


data 

average  correct  detection  rate 

polydat_l 

81% 

polydat_2 

79% 

polydat_3 

86% 

Fig.30:  Clustering  results  using  individual  features 
(using  sessions  as  the  counting  dimension) 


data 

average  correct  detection  rate 

polydat_l 

91% 

polydat_2 

82% 

polydat_3 

94% 

Fig.31;  Clustering  results  using  individual  features 
(using  examinations  as  the  counting  dimension) 

^°See  "Evaluation  suategy"  in  chapter  3.I.3.4. 
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data 

polydat_l 

polydat_2 

polydat_3 


average  correct  detection  rate 

93% 

87% 

97% 


Fig.32:  Clustering  results  using  individual  features 
(counting  only  those  examinations  with  more  than  two  sessions) 

In  the  following  figures,  a  comparison  between  the  three  polydatj's  were  made  using  the 
best  feature  combination  for  one  of  the  polydat_i's  at  a  time  and  testing  it  for  the  other 
two  ones.  As  you  will  see,  the  best  result^*  -  while  taking  the  same  features  for  each 
polydatj  -  is  79.7%  for  the  feature  combination^^  [9,  30,  81,  197,  478,  111],  and  in 
average  79.3%. 


feature  tuple 

37,  79,  1 1 1,  1 1 1,  197,  235,  449,  457 
37,  1 1 1,  1 1 1,  197,  235,  452, 457,  460 
37,  1 1 1,  1 1 1,  235,  235,  453,  457,  460 

30.81.81,  111,  197, 458 

9.30.81,  111,  197,  458 

8,37,  50,  79,  111,  111,235,  235,... 
358,  452,  453, 478,  478,  478,  482 


poIydat_i 

i=2 

i=l 

86% 

77% 

75% 

86% 

77% 

75% 

86% 

77% 

74% 

85% 

79% 

73% 

85% 

79% 

73% 

85% 

76% 

76% 

Fig.33:  Comparison  #1  (dimension:  sessions) 

(taking  some  of  the  best  polydat_3  feature  tuples  and  testing  it  for  the  others) 

For  the  exact  labels  of  this  feature  numbers  see  appendix,  Fig.42. 


^^With  polygraph  sessions  as  the  counting  dimension. 
^^See  Fig.35,  "Comparison  #3". 
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polydat_i 

feature  tuple 

ti 

i=2 

i=3 

9,  30,  30,  39,  235,  450 

80% 

75% 

81% 

30,  30,  39,  50,  235,  450 

80% 

75% 

81% 

30,  30,  39,  81,  235,  450 

80% 

75% 

81% 

30,  30,39,  197,  235,  450 

81% 

74% 

82% 

30,  30,  39,  235,  363,  450 

81% 

75% 

81% 

30,  30,  39,  235,  358, 450 

80% 

76% 

81% 

30,  30,  39,  235,  450,  458 

80% 

75% 

81% 

30,  30,  39,  235,  482,  450 

80% 

75% 

81% 

30,  30,  39,  235,  361,  450 

80% 

75% 

81% 

30,  30,  39,  235,  359,  450 

80% 

75% 

81% 

30,  30,  39,  235,  450,  457 

80% 

75% 

81% 

30,  39,  235,  363,  450,  482 

80% 

72% 

83% 

30,  39,  235,  363,  450,  478 

80% 

71% 

83% 

Fig.34:  Comparison  #2  (dimension:  sessions) 

(taking  some  of  the  best  polydat  l  feature  tuples  and  testing  it  for  the  others) 
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poIydat_i 

feature  tuple 

tl 

i=l 

i=3 

9,30,81,  197,478,  111 

79% 

75% 

85% 

9,30,  50,81,  197,  111 

79% 

74% 

85% 

9,  30,81,358,  197,  111 

79% 

74% 

85% 

9,30,  81,359, 197,  111 

79% 

74% 

85% 

9,30,81,  197, 457,  111 

79% 

74% 

85% 

30,81,  105,  111,  197,358 

79% 

74% 

84% 

30,  81,  105,  111,  197,359 

79% 

74% 

84% 

30,  81,  105,  111,  197,  457 

79% 

74% 

85% 

30,81,  105,  111,  197,  459 

79% 

74% 

84% 

30,81,  111,  197,358,359 

79% 

74% 

85% 

30,81,  111,  197,358,  456 

79% 

74% 

85% 

30,81,  111,  197,358,  457 

79% 

74% 

85% 

30,81,  111,  197,358,  459 

79% 

74% 

85% 

30,81,  111,  197,  359, 456 

79% 

74% 

85% 

30,  81,  111,  197,  359,  457 

79% 

74% 

85% 

30,81,  111,  197,359,  459 

79% 

74% 

85% 

30,  81,  111,  197,  456,  457 

79% 

73% 

85% 

30,81,  111,  197,  456,  459 

79% 

74% 

85% 

30,81,  111,  197,  457,  459 

79% 

74% 

85% 

30,105,111,197,359,459 

79% 

74% 

84% 

30,  105,  111,  197,  456,  459 

79% 

74% 

84% 

30,  105,  111,  197,  457,  459 

79% 

74% 

85% 

30,  105,  111,  197,  456,  457 

78% 

74% 

85% 

30,  111,  197,358,359,  459 

78% 

74% 

85% 

30,111,197,358,456,459 

78% 

74% 

85% 

30,  111,  197,358,  457,  459 

78% 

74% 

85% 

30,  111,  197,  456,  457,  459 

78% 

74% 

85% 

Fig.35:  Comparison  #3  (dimension:  sessions) 

(taking  some  of  the  best  pofydat_2  feature  tuples  and  testing  it  for  the  others) 
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4.2.  LMS  fuzzy  adaptive  filter 


The  first  test  we  did,  was  to  find  the  performance  of  the  filter  before  any  training.  That  is, 
we  used  the  classifier  as  a  conventional  fuz2y  logic  system  designed  solely  based  on  the 
four  linguistic  rules  mentioned  above.  The  results  are  listed  in  the  following  table. 


correct  detection  rate  in 


polydat_i 

non-deceptive  files 

deceptive  files 

average 

i=i 

70% 

72% 

71% 

i=2 

70% 

76% 

73% 

i=3 

70% 

88% 

79% 

Fig.36;  Results  based  solely  on  4  aforementioned  linguistic  rules 

without  any  training 

Note  that  the  percentage  of  correct  recognition  for  non-deceptive  subjects  are  the  same 
for  polydat_l,  polydat_2,  and  polydat_3,  because  they  are  all  the  same  data”.  Also  note 
that  the  results  are  best  for  polydatj,  as  it  was  for  KNN  and  FCM.  This  may  be  partially 
due  to  polydat_3's  good  performance  in  general,  independent  of  the  classifying  schemes. 
We  believe  that  it  may  also  be  a  result  of  us  setting  up  the  linguistic  rules  by  having 
observed  polydat_3. 

However,  the  outcomes  for  polydat_l  and  polydat_2  are  good  enough  such  that  one  can 
be  sure  the  linguistic  rules  are  sufficiently  general  even  for  data  that  we  did  not  examine. 

As  mentioned  in  chapter  3.2.3,  we  then  tested  the  fuzzy  LMS  algorithm  trained  with 
twenty  training  data  (ten  deceptive  and  ten  non-deceptive)  and  again  with  seventy  training 
data  (thirty-five  deceptive  and  thirty-five  non-deceptive)  for  the  three  sets  of  data,  for  a 
total  of  six  tests.  Twenty  trials  were  performed  for  each  test,  and  the  system  was 
initialized  with  the  linguistic  rules  before  each  trial.  The  training  data  were  randomly 
chosen  for  each  trial,  and  the  rest  of  the  available  data  in  each  set  were  for  testing. 


53see  polygraph  files  on  chapter  6.2. 
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We  computed  the  percentage  of  correct  recognition  of  testing  data  for  each  trial, 
averaging  the  performance  for  deceptive  and  non-deceptive  subjects.  The  recognition  rate 
of  those  twenty  trials  are  averaged,  rounded  to  two  digits,  and  reported  in  the  following 
table.  The  sample  standard  deviations  are  also  shown. 


correct  detection  rate 


polydat_i 

version  #1 

version  #2 

i=l 

75%  (6%) 

73%  (2%) 

i=2 

74%  (7%) 

73%  (3%) 

i=3 

78%  (6%) 

79%  (2%) 

version  il;  70  training  &  30  testing  sessions 
version  #2:  20  training  &  80  testing  sessions 
(standard  deviation  in  parentheses) 

Fig.37:  Average  percentage  of  correct  detection  rate 
for  twenty  trials  of  each  test 

As  may  be  expected,  the  recogmtion  rate  improves  in  general  when  training  data  is  used, 
as  compared  to  the  results  of  the  untrained  system.  Also,  the  recognition  rate  is  typically 
higher  when  the  system  is  trained  with  more  data.  The  difference,  however,  is  not 
dramatic.  The  use  of  training  data  offers  small  incremental  improvements.  The  one 
exception  would  be  for  data  set  polydatj.  Here  more  training  data  seems  to  lower  the 
performance.  The  effect  is  probably  due  to  the  fact  that  the  initialization  of  the  reasoning 
rules  were  based  on  our  examination  of  polydat_3,  which  covered  all  100  data.  Yet  the 
training  algorithm  was  to  learn  only  a  subset  of  that,  so  it  was  handicapped  compared  to 
human  reasoning. 


Human  reasoning  may  also  be  better  in  this  case  because  the  training  algorithm  only 
attempts  to  optimize  the  system  in  the  least  mean  square  sense,  slightly  different  than  our 
ultimate  goal  of  maximizing  recognition  rate.  At  any  rate,  when  the  standard  deviation  is 
taken  into  account,  the  difference  in  recognition  rate  becomes  insignificant. 

Another  noticeable  difference  between  the  results  using  different  amounts  of  training 
samples  is  the  value  of  the  sample  standard  deviation.  A  large  number  of  testing  data  leads 
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to  a  small  standard  deviation.  Conversely,  a  small  amount  of  testing  data  leads  to  a  large 
standard  deviation.  This  confirms  what  we  intuitively  know;  the  average  percentage  of 
correct  recognition  is  more  accurate  when  a  large  amount  of  testing  data  is  available. 


The  above  observations  illustrate  a  practical  issue  in  using  many  adaptive  and  learning 
algorithms,  that  of  partitioning  a  limited  amount  of  data  into  training  and  testing  sets.  For 
most  algorithms,  too  much  data  in  training  and  little  in  testing  leaves  little  assurance  about 
the  performance  of  the  system.  On  the  other  hand,  too  much  data  in  testing  and  little  in 
training  assures  mediocre  performance  from  the  system. 

More  data  for  both  training  and  testing  would  help,  but  many  times  that  may  not  be 
available.  Fuzzy  logic  systems  mitigate  this  problem  by  exploiting  linguistic  information. 
Unlike  neural  networks  and  many  statistical  techniques,  which  are  completely  dependent 
on  numerical  data,  this  fuzzy  LMS  algorithm  uses  numerical  data  mainly  to  optimize  a 
good  fiizzy  system.  The  above  results  show  that,  given  good  initialization  of  the  reasoning 
rules,  the  system  can  perform  well  even  with  little  or  no  training  data.  This  robustness  is 
one  of  the  many  advantages  of  fuzzy  logic. 


4. 3.  Other  observations: 


During  this  project,  aside  from  the  results  and  conclusions  we  were  looking  for,  we  also 
obtained  several  side  results.  In  this  passage,  I  will  mention  some  of  the  interesting 
observations  we  made. 

1.  As  mentioned  before,  the  fiizzy-c-means  (FCM)  algorithm  is  initialized  by  random 
chosen  membership  values  which  will  be  modified  and  optimized  during  the  iterative 
process.  Thus,  FCM  algorithm  is  almost  independent  of  the  initial  membership  values. 
During  our  testing  process,  we  noticed  that  the  FCM  algorithm  is  not  absolutely 
independent  of  the  initial  values.  Thus,  it  is  possible  that 

•  the  algorithm  may  run  into  different  local  minima  or 

♦  because  of  its  unsupervised  nature,  the  algorithm  may  switch  the  clusters, 
i.e.  if  -  depending  on  our  interpretation  -  the  first  cluster  represents  the  non- 
deceptive  and  the  second  one  the  deceptive  files,  it  might  be  the  opposite 
while  using  other  initial  random  values. 

To  avoid  any  misinterpretations,  I  decided  to  create  two  sets  of  random  membership 
values  (for  c=2  and  c=3)  and  save  them  as  fixed  initialization  values  for  any  further 
simulations.  In  the  following  figure, '+'  represents  the  non-deceptive,  the  decptive  files; 


Fig.38:  Fixed  initial  random  membership  values  for  c=2 
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2.  "Outlier  effect": 

In  the  real  world  of  using  an  automated  polygraph  system  as  suggested  in  this  project,  we 
have  to  keep  in  mind  the  existence  of  the  outlier  effect.  This  occurs,  for  instance,  when  a 
non-deceptive  person  (=  membership  value  between  zero  and  0.5)  becomes  misclustered 
in  a  deceptive  data  space  with  a  very  high  membership  value  close  to  one.  In  other  words, 
if  a  normal  non-deceptive  person  gets  labeled  as  very  deceptive,  or  vice-versa. 

We  noticed  this  phenomenon  in  both  clustering  and  classifying  algorithms^'*.  We  also 
noticed  that  by  making  the  system  "fuzzier"  -  e.g.  higher  m  or/and  c  for  FCM  -  as 
expected,  the  outlier  effect  can  be  reduced,  but  not  eliminated  though. 

3.  "Performance  limitations": 

There  seem  to  be  a  limit  in  recognition  rate  using  the  features  available  by  both  fuzzy 
algorithms  used  in  this  project  and  also  by  fuzzy  k-nearest  neighbor  algorithm  used  in 
previous  works  [Layeghi  1993,1]  [Dastmalchil993]  for  all  the  available  polydatj's.  There 
may  also  be  psychophysiological  limitation  on  the  recognition  rate.  However,  polydat_3 
provided,  independent  of  all  the  three  algorithms,  the  best  results  compared  to  the  other 
two  polydatj's. 


^‘‘See  also  "Epilogue”. 
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4.3.  A  COMPARISON 


BETWEEN  THE  THREE  FUZZY  ALGORTHMS  USED  IN  THIS 
AND  THE  PREVIOUS  PROJECT 

(FUZZY-C-MEANS,  LMS  FUZZY  ADAPTIVE  FILTER  AND  FUZZY  K-NEAREST  NEIGHBOR) 


The  fuzzy  LMS  system  is  unique  in  its  application  of  linguistic  knowledge.  As  mentioned 
earlier,  the  use  of  linguistic  knowledge  ensures  the  robustness  of  the  fiizzy  system.  The 
use  of  linguistic  information  also  ameliorates  the  problem  of  not  having  enough  reliable 
numerical  data.  Unlike  classification  schemes  such  as  the  K-Nearest  Neighbor,  the  fuzzy 
LMS  algorithm  is  not  entirely  dependent  on  numerical  data. 

When  applied  to  pattern  recognition,  fuzzy  logic  systems  can  be  set  up  to  perform  like 
KNN  systems.  In  KNN  systems,  numerical  data  of  known  class  patterns  are  set  up  to 
estimate  the  probability  density  distribution  of  the  classes.  The  probabilities  of  new  data 
points  belonging  to  the  different  classes  are  then  computed  based  on  such  distribution. 
Data  points  around  known  class  samples  are  then  classified  into  the  same  class  with  a 
higher  probability.  The  fuzzy-KNN  algorithm  modifies  the  classical  KNN  algorithm  by 
taking  into  account  the  distance  between  the  data  point  and  the  known  class  patterns  when 
estimating  the  probability.  Conceptually  this  is  similar  to  setting  up  clusters  around  all 
known  class  samples  and  calculating  the  degree  of  belonging  of  new  data  points  in  the 
different  types  of  clusters.  Other  than  the  exact  mathematical  equations,  that  description 
fits  a  fiizzy  adaptive  system  where  each  rule  corresponds  to  a  known  class  pattern  and  the 
size  of  the  clusters  is  the  same  for  all  rules. 

However,  fiizzy  adaptive  systems  give  up  some  of  the  nice  theoretical  understandings  of 
the  KNN  systems  but  gain  some  practical  advantages.  The  number  of  rules  required  are 
usually  much  smaller  than  the  number  of  known  samples.  Fuzzy  logic  can  usually  exploit 
that  to  reduce  system  complexity. 

Furthermore,  the  system  complexity  for  a  fiizzy  adaptive  system  stays  the  same  even  as 
new  information  are  available.  This  is  partly  a  result  of  the  way  this  algorithm  adapt 
continuously;  new  information  are  learned  as  old  ones  are  forgotten.  The  fiizzy  LMS 
learning  technique  is  like  backpropagation,  a  popular  neural  network  training  technique. 
However,  the  fiizzy  LMS  learning  algorithm  requires  few  epochs  for  training.  In  all  our 
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trials  the  maximum  recognition  rates  for  testing  data  peaked  in  less  than  thirty  epochs. 
About  95%  of  them  peaked  in  less  than  twenty  epochs”.  This  is  a  few  orders  of 
magnitude  less  than  most  applications  of  backpropagation.  In  many  cases  the  peaks 
occurred  before  any  training;  that  is,  the  system  uses  only  linguistic  rules.  Here  the  use  of 
expert  knowledge  speeds  up  the  training  of  the  system. 

The  fuzzy-c-means  algorithm,  unlike  fuzzy  LMS,  is  an  unsupervised  clustering  algorithm. 
Given  a  set  of  data,  FCM  looks  for  a  (usually)  predetermined  number  of  clusters  within 
the  data  points.  It  does  not  use  any  knowledge  about  the  correct,  or  desired  classification 
of  any  of  the  elements.  The  algorithm  only  minimizes  an  objective  function,  which  is  the 
sum  of  a  function  of  the  data  points'  membership  values  and  the  distances  between  the 
data  points  and  the  clusters'  centers. 


FCM  operates  like  a  black  box;  given  some  data,  the  algorithm  automatically  computes 
the  results”.  This  presents  the  advantage  that  different  sets  of  data  using  different  features 
can  be  tested  in  a  routine  manner.  FCM  also  presents  a  way  to  normalize  the  different 
dimensions  of  the  data,  just  like  the  use  of  sigma  in  the  fiizzy  LMS  algorithm.  However, 
unlike  fiizzy  LMS,  FCM  does  not  present  a  method  to  find  the  optimal  way  for  such 
normalization. 

The  fuzzy  LMS  algorithm,  however,  does  pose  some  potential  problems  of  its  own.  The 
use  of  expert  knowledge,  while  a  benefit  in  some  senses,  may  not  be  always 
straightforward.  For  example,  in  our  project  we  did  not  have  any  specific  knowledge 
about  the  polygraphy  itself  Whatever  we  learned,  we  learned  by  looking  at  numerical 
data.  As  we  tried  to  find  more  complicated  patterns,  patterns  involving  three,  four,  or 
more  features,  the  analysis  became  more  difficult.  Naturally  one  wishes  to  automate  this 
process.  If  we  do  not  rely  on  some  learning  procedures,  however,  rules  cannot  be 
automatically  found  for  the  fiizzy  system.  Much  research  also  needs  to  be  done  to 
understand  the  fiizzy  LMS  algorithm's  learning  dynamics.  While  the  same  method, 
gradient  descent,  is  used  on  both  backpropagation  and  the  fiizzy  LMS  algorithm,  the 
general  shapes  of  the  error  surface  between  the  two  are  different.  In  backpropagation,  all 
the  parameters  have  the  same  range  and  lie  in  an  uniform  neural  network  structure.  In  the 
fuzzy  LMS  algorithm,  the  parameters  can  have  different  ranges  and  lie  a  fiizzy  logic 


^^However,  we  ran  every  trial  to  forty  epochs  to  ensure  that  there  is  no  "false"  peak. 
^^Our  job  is  basically  to  adjust  the  parameters. 
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structure  that  is  not  completely  uniform.  The  effects  of  such  differences  on  the  shape  of 
the  error  surface  and  the  learning  dynamic  are  unknown. 

In  the  following,  I  will  mention  again  some  of  the  results  we  obtained  by  using  different 
fiizzy  clustering  or  classifying  algorithms.  Recall  that  also  the  searching  strategies  to  find 
the  best  features  -and  feature  combinations-  were  different  for  each  of  the  aforementioned 

algorithms*"^. 


poIydat_i 

tl 

fu2^-c-means** 

91% 

82% 

94% 

fuzzy-c-means*’ 

93% 

87% 

97% 

fuzzy  K-nearest-neighbor 

86% 

80% 

91% 

LMS  fuzzy  adaptive  filter 

81% 

83% 

83% 

fuzzy-c-means^ 

81% 

79% 

86% 

The  results  are  rounded. 


Fig.39:  Comparison  between  different  fuzzy  algorithms 
used  for  polygraph  classification  in  this  and  in  the  previous  research 

The  results  of  our  fuzzy  LMS  system,  while  impressive  for  such  a  simple  set-up,  are  not 
comparable  to  the  results  of  the  same  project  using  other  systems.  We  believe  that  the 
recognition  rate  will  increase  for  few  percentage  points  by  using  the  suggestions  in  chapter 

5.1. 


57see  the  following  chapters  3.1.3.1, 3.1.3.2.1  -  4  for  the  searching  strategies  used  for  the  FCM, 

chapter  3.2.1  for  the  visual  inspection  used  for  the  LMS  system, 

and  chapter  III.3.3.  in  [Uyeghil993,l]  for  the  methods  used  for  the  KNN. 

**FCM  using  examinations  as  the  counting  dimension  (see  chapter  4.I.2.3.  Md  Fig.31). 

*^e  same  as  above  but  counting  those  examinations  with  more  than  2  sessions  (see  Fig.32). 

60Since  we  took  35  out  of  50  available  non-deceptive  sessions  for  training  the  LMS  filter,  it  would  ^ 
meaningless  to  evaluate  this  algorithm  by  examinations  as  the  counting  dimension.  Yet,  in  order  to  make 
it  comparable  to  the  other  algorithms,  the  results  of  the  FCM  with  sessions  as  the  counting  dimension  are 

also  shown. 
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§5.  FUTURE  STEPS  AND  SUGGESTIONS 


5.1.  The  algorithms: 

As  mentioned  earlier  in  chapter  2.2.3.  about  the  fuzzy-c-means  algorithm,  the  performance 
of  this  clustering  model  is  influenced  by  the  choice  of  various  parameters.  In  this  project,  I 
tried  to  find  the  optimum  values  of  the  majority  of  them.  However,  there  are  several  other 
points  which  should  be  studied  more  comprehensively:  They  are 

•  the  initial  cluster  centers, 

•  the  order  in  which  the  samples  are  taken  as  input, 

•  the  choice  of  distance  measure, 

•  the  termination  criteria  and 

•  the  geometrical  properties  of  the  data. 


Most  imprtantly,  more  information  about  the  geometrical  arrangement  of  the  data  points 
and  the  appropriate  choice  of  the  norm  could  help  us  improve  the  clustering  algorithm. 
There  are  several  suggestions  in  [Bezdekl981]  [Bezdekl992]  [IIScorpl993]  for  a  better 
understanding  of  the  algorithm's  dynamics  and  for  making  systematic  decisions  concerning 
different  types  of  distance  norms  and  elliptical  cluster  shapes. 

For  future  studies,  I  highly  recommend  a  deeper  investigation  of  our  clustering  algorithm 
by  setting  c=3  and  trying  defuzzification  thresholds  other  than  0.5. 

In  this  project,  we  decided  to  systematically  test  the  FCM  algorithm  with  different  values 
of  m  to  find  its  optimum.  For  additional  (and  more  theoretical)  investigations,  I  suggest 
[Choel992]  as  an  introductory  step.  It  may  be  also  helpful  to  use  different  values  of  m  for 
different  sessions  simultaneously,  while  looking  for  the  most  realistic  clusters  within  the 
entire  session  space. 


An  exciting  additional  investigation  would  be  a  new  polydat  made  up  of  the  best  clustered 
sessions  of  our  three  polydatj's  as  a  reference  for  any  further  clustering  process.  By  doing 
this  we  could  give  the  algorithm  a  better  chance  to  cluster  correctly  even  the  critical 
sessions. 
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Concerning  the  LMS  adaptive  algorithm,  one  may  investigate  the  effect  of  changing  the 
learning  factor;  throughout  our  experiment  it  remained  at  0.005.  Upon  observing  the 
quickness  of  learning  in  our  testing,  we  believe  the  learning  factor  can  be  decreased  in  the 
future. 

We  also  believe  that  there  should  not  be  just  one  but  at  least  three  different  learning 
factors:  one  for  the  cfs,  one  for  the  0's,  and  one  for  the  x,'s;  because  these  three  types  of 
parameters  lie  in  a  very  irregular  parameter  space,  unlike  that  of  backpropagation  where 
all  parameters  lie  in  a  more  or  less  uniform  parameter  space. 

For  illustration,  the  three  types  of  parameters  comapred  to  one  another  have  very  different 
numerical  ranges.  Conceptually  speaking,  a  parameter  with  a  large  range  of  movement 
should  generally  have  a  larger  learning  factor  than  one  with  a  smaller  range  of  movement. 
However,  the  gradient  and  the  general  shape  of  the  error  surface  would  also  affect  the 
value  of  the  learning  factors.  It  is  possible  that  with  a  constant  learning  factor,  a  factor  that 
is  too  large  for  one  type  of  parameter  -  one  that  causes  oscillation  for  that  parameter  -  may 
be  too  small  for  another  type  of  parameter  and  effects  little  change.  That  is,  some 
parameters  become  more  willing  to  adapt  while  others  hesitate  to  change. 

Setting  up  separate  learning  factors  for  the  different  types  of  parameters  should  eliminate 
this  problem.  However,  choosing  a  learning  factor  is  still  a  complex  trial-and-error  task, 
and  having  more  learning  factors  to  deal  with  requires  more  sophisticated  understanding 
of  the  learning  dynamics  we  possess.  Plots  of  the  mean  squared  error  of  two  sets  of 
randomly  chosen  training  data  suggest  that  there  are  noticeable  points  where  the  rate  of 
decrease  dramatically  changes  (see  the  following  figure). 
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Two  cases  of  the  mean  squared  error  of  training  data  as  the  system  '"learns. 


Fig.40:  The  influence  of  the  learning  factor 

More  rules  and  features  should  be  added  to  improve  this  LMS  system.  As  the  complexity 
of  the  system  grows,  however,  the  design  will  depend  more  on  the  learning  algorithm  than 
on  heuristic  knowledge.  This  requires  much  more  understanding  of  the  learning  dynamics. 
Preliminary  testing  with  three  features  and  eight  rules  shows  little  improvement  in 
recognition  rate.  Obviously  many  additional  studies  need  to  be  done  in  this  case. 

As  mentioned  in  chapter  "Setting  Linguistic  Rules",  for  future  investigations  one  may  also 
experiment  with  different  decision  thresholds  for  detemuning  deception  and  nondeception. 
However,  the  benefit,  if  any,  of  this  is  not  clear.  One  may  also  experiment  with  mapping 
the  fiizzy  output  to  a  confidence  value  in  addition  to  just  a  deception/nondeception 
decision.  This  may  be  more  helpful  in  practical  situations.  One  should  also  test  the 
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algorithm  with  random  initializations;  that  is,  without  using  any  expert  knowledge.  It 
would  be  interesting  to  compare  the  training  time,  performance,  and  robustness  of  that 
system  to  the  present  one. 

Fuzzy  logic  systems  promote  rapid  development  of  robust,  simple,  and  reliable  systems. 
Our  project  validated  that  point.  Some  of  the  main  problems  with  designing  iroditioncil 
fuzzy  logic  systems,  however,  are  their  dependence  on  heuristic  information,  their  lack  of 
design  automation  and  their  unproven  ability  to  reach  an  optimal  solution  by  linguistic 
rules  alone.  Our  use  of  the  LMS  learning  algorithm  attempts  to  solve  such  problems.  The 
learning  algorithm  did  offer  small,  incremental  improvements,  but  we  believe  that  the 
learning  algorithm  has  not  yet  been  explored  fiilly.  A  better  understanding  of  the  learning 
dynamics  would  offer  more  insight  into  improving  the  system. 

In  future  works,  one  may  also  consider  other  strategies  which  use  irrelevant  questions, 
(see  Fig. 7).  These  questions  could  be  easily  exploited  for  normalizing  the  data  and  making 
it  independent  of  individual  charateristics  of  the  tested  subjects. 

5.2.  The  polygraph  examination: 

As  expected^i,  and  eventually  proven^,  our  clustering  system  can  provide  an  up  to  12% 
more  correct  detection  rate  by  using  the  dependency  between  the  polygraph  sessions. 
Therefore,  I  recommend  recording  at  least  three  -  ideally  five  -  test  sessions  with  different 
a  order  of  questions  per  each  examinations.  Thus,  in  cases  where  some  sessions  within  an 
examination  are  clustered  incorrectly,  the  algorithm  can  easily  ignore  the  minority  and  find 
the  right  cluster  according  to  the  correctly  clustered  majority. 

One  may  also  consider  other  time  frames,  and  emphasize  those  features  which  enabled  us 
to  cluster  the  data  the  best.  It  may  also  be  helpful  to  mark  the  data  of  female  and  male 
subjects,  or  to  consider  them  differently,  since  the  ranges  of  the  biophysical  reactions  are 
not  in  the  same  numerical  spaces. 

Ultimately,  an  automated  polygraph  system  which  uses  the  aforementioned  strategies  to 
distinguish  between  truth  and  deception  should  have  a  built-in  feature  extraction  tool 
which  can  directly  feed  the  needed  data  to  the  algorithm. 


®^See  chapter  3. 1.3.4. 
62see  chapter  4. 1.2.3. 
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§6.  APPENDIX 


Fig.41:  List  of  labels  of  all  the  features  used  in  this  project 


Fig.41:  Continued 


Non-dcceptive 

QQ8R9OIO.011 

QQ8R9OIO.021 

QQ8R9OIO.031 

QQ95LU1T.011 

QQ95LU1T.021 

QQ95LU1T.031 

QQAURNUS.021 

QQAURNUS.031 

QQAV53P6.011 

QQAV53P6.021 

QQAV53P6.031 

QQBQ4SHI.011 

QQBQ4SHI.021 

QQBQ4SHI.031 

QQBSS7WT.011 

QQBSS7WT.021 

QQBSS7WT.031 

QQ70XM60.021 

QQ7RH0RO.011 

QQ7RH0RO.021 

QQ7RH0RO.031 

QQ7R51P9.011 

QQ7R51P9.021 

QQ7R51P9.031 

QQ9TDSP3.011 

QQ9TDSP3.021 

QQ9TDSP3.031 

QQA8OWOI.011 

QQA8OWOI.021 

QQA8OWOI.031 

QQBT22O6.011 

QQBT22O6.021 

QQBT22O6.031 

QQBO9O_9.011 

QQBO9O_9.021 

QQBO9O_9.031 

QQBC7PP6.011 

QQBC7PP6,021 

QQBC7PP6.031 

QQCHCK_O.011 

QQCHCK_O.021 

QQCHCK_O.031 

QQCDTKPO.Oll 

QQCDTKP0.031 

QQCDTKP0.041 

QQCM5Y56.011 

QQCQQT8Y.011 

QQCQQT8Y.021 

QQCQQT8Y.031 

QQCQQT8Y.041 


Deceptive  1 _ 

QQ4Q1O83.011 

QQ4Q1O83.021 

QQ4Q1O83.031 

QQ4Q3MDC.011 

QQ4Q3MDC.021 

QQ4Q3MDC.031 

QQ51DE36.011 

QQ51DE36.021 

QQ51DE36.041 

QQ6RQGH6.011 

QQ6RQGH6.021 

QQ6RQGH6.031 

QQ6RQGH6.041 

QQ6T711O.011 

QQ6T71 10.021 

QQ6T7 110.031 

OO6Z59IG.011 

OO6Z59IG.021 

OO6Z59IG.031 

OO7PP9B9.011 

OO7PP9B9.021 

OO7PP9B9.031 

007PDU1X.011 

OO7PDU1X.021 

OO7PDU1X.031 

007_PIPF.011 

007_PIPF.021 

007_PIPF.031 

007_JT70.011 

007_JT70.021 

OO7_n’70.031 

OO738DYX.011 

OO738DYX.021 

OO738DYX.031 

OO75ULP9.011 

OO75ULP9.021 

OO75ULP9.031 

0079_EYF.011 

0079_EYF.021 

0079_EYF.031 

007BGDML.011 

OO7BGDML.021 

OO7BGDML.031 

OO7ETC8I.011 

OQ7ETC8I.021 

007ETC8I.031 

0Q7JA0CS.011 

OO7JAOCS.021 

OO7JAOCS.031 

OO7LX5O0.011 


Deceptive  2 

OO7LX5O0.021 

OQ7LX5O0.031 

OO7MN2Y0.011 

OO7MN2Y0.021 

OO7MN2Y0.031 

QO7TC5UF.011 

OO7TC5UF.021 

OQ7TC5UF.031 

OO7TOVER.011 

OQ7TOVER.021 

OQ7TOVER.031 

OQ7TVADC.011 

OO7TVADC.021 

OO7TVADC.031 

OO7U2T4R.011 

OO7U2T4R.021 

OO7U2T4R.031 

OQ7YP7OU.011 

OO7YP7OU.021 

OQ7YP7OU.031 

0O7YZOJ3.011 

OQ7YZOJ3.021 

OO7YZOJ3.031 

008_odpt.oii 

008_ODPT.021 

OO8_0DPT.031 

008_ODPT.041 

OQ8_2UO9.011 

008_2U09.021 

OQ8_2UO9.031 

OO800IG6.011 

OO800IG6.021 

OO800IG6.031 

0082OIU9.011 

00820IU9.021 

OO82OIU9.031 

OQ82SUTX.011 

OO82SUTX.021 

OO82SUTX.031 

00860ZNU.011 

OO860ZNU.021 

OO860ZNU.031 

0089U_ZR.011 

0089U_ZR.021 

OO89U_ZR031 

OO8ATU26.011 

OO8ATU26.021 

OO8ATU26.031 

OO8FGMVI.011 

OO8FGMV1.021 


Deceptive  3 _ 

OO8RAJ0C.011 

OO8RAJ0C.021 

OO8RAJ0C.031 

009EUKVT.011 

OO9EUKVT.021 

009EUKVT.031 

009100X0.021 

0Q9IOOXO.041 

009SOW8L.011 

0O9SOW8L.021 

009SOW8L.031 

OO9SOIK9.011 

OO9SOIK9.021 

OO9S0k:9.031 

OO9W0B9F.011 

OO9W0B9F.031 

OO9W0B9F.041 

OQ9U4FMU.011 

009U4FMU.021 

OO9U4FMU.031 

009Y_SVF.011 

009Y_SVF.021 

009Y_SVF.031 

OO9YH3OF.011 

OO9YH3OF.021 

009YH30F.031 

OOA2TT4C.011 

OOA2TT4C.021 

OOA2TT4C.031 

OOA3HIRX.011 

OOA3HIRX.021 

OOA3HIRX.031 

OOA32UTF.011 

OOA32UTF.021 

OOA32UTF.031 

OOA6U_IF.011 

OOA6U_IF.031 

OOA6U_IF.041 

OOAM4E3L.011 

OQAM4E3L.021 

OOAM4E3L.031 

00ARF2_X.011 

OOARF2_X.021 

OQARF2_X.031 

OOAWA38X.011 

OOAWA38X.021 

OOAWA38X.031 

OOAYXZGU.011 

OOAYXZGU.021 

OOAYXZGU.031 


Fig.42:  List  of  polygraph  files  used  in  this  experiment 
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6.3.  USER  INTERFACE 


For  an  automated  polygraph  system  as  a  real  product,  the  existence  of  an  user-friendly 
interface  is  unavoidable.  MATLAB  software  environment  provide  an  easy-to-use  toolbox 
for  creating  various  kinds  of  interactive  interface  classes.  The  following  figure  shows  an 
interface  used  in  one  of  my  representations.  This  was  made  for  a  technically  oriented  user 
who  is  familiar  with  the  algorithm.  A  simpler  black-box  version  of  a  polygraph  system, 
appropriate  to  the  user's  requests,  can  likewise  be  programmed. 


Fig.43:  An  example  for  a  technical  user  interface 
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6.4.  PROGRAM  LISTINGS 

(Implementation  in  MATLAB) 


%  THIS  PROGRAM  CALCULATESTHE  CLUSTER  CENTERS  FOR 
%  A  MULTIDIMENSIONAL  FCM  -  C«=2,  CONST. 

function  V  •  c_center<X,  U,  m) 

[colE,rowE]  =  size(X); 
k=lTOwE; 

Hfortiie  1ft  class: 

Vl_numerator»=U(lJc)An  •  X(:Jcy; 

%  (  •^>(»):  because  ftc  ’numerator  sum"  is  automatically 
%  included  within  the  matrix  multiplication. 

V(l,:)  “  Vl^numerator  f  sum(  U(1  ); 

%  V(l,:)  (and  Vl_numeratorl  is  a  n-dimensional  row-vector. 

%  n  represents  ftc  number  of  the  clustering  fcaturcs(n=30). 

%fcM^fte2nddass: 

V2_numerator«U(2>:).^  *  XfiJcX; 

•/•  (.•^>(*): .  .sec  above. 

V(2,;) = V2_numerator  /  sum(  U(2.k).^  ); 

%  tHk  is  a  n-dimensional  row-vector  and  fte  dustcr-centci 
%offte  2nd  class. 


V^V;  H  [nxc]  matrix 

return; 


%  FUZZY  C-MEANS  ALGORITHM  FOR  MULTl-DIMEN SIGNAL  FCM. 

•/ofimetion  bcst_Uik  »  fc_means(m,  cpsilon.X) 
function  tbest_Uik,  z]  *  fc_means(m,  epsilon.X) 

%fimction  best_Uik  =  fc_means(m,  epsflon) 

•/ofunction  [best_Uik,  V.  X]  =  fc^means(m,  epsilon) 

%  ftink  about  fte  X 

load  init_u; 


%  load  init_v. 


%loadset31; 

%X=featmat; 

%load  set3me;X=Xselect; 

^format  long; 

J_m=  100000000; 
z=0; 

while  J_m  >  epsilon 

V  =  c_centeT(X,  U,  m); 


%  start  wift  fte  initialization  of  fte  roemb^fet 
%(Uik=>Vi) 

%  or  with  the  cluster  centers 
•/oCV^i=>Wk) 

%  including  the  data  X  respect  XI ,  X2,  ... 


%  avoid  errors  by  visual  comparing  the  numbers 
%  to  make  sure  fte  start  is  o  Jc. 


U  =  roemb_fctpC,  V,  m); 

Jtemp  »  J_m, 

J_m  =  j_rodim(X  V,  U,  m); 

if  q»ilon  <*=  0.000005 

if  (abs(J_m  •  Jtemp)  <=  .00000000001). 

%ifj  m*“  Jtemp,  %  to  terminate  ftc  loop  by  reaching 

H  the  minimum  of  J_m. 

break, 

end 

else 

if  (abs(J_m  -  Jtemp)  <“  .0001).%--oJt 
%ifj  m  — Jtemp,  %  to  terminate  the  loop  by  reaching 

%  the  minimum  of  J_m. 

break, 

end 

end 


S  t»abs(U-tcmp);  Htdetance  value  for  the  iteration 

z-z+1; 

ifrem(z,10)»H) 

fyrintfC.VnOi 

else 

fyrinft;’.  0; 
end 
end 

^)Tintfi;V');fyn^t^r^ _ 


%Vnew-V; 

%  recall  the  extrem  values:  J_ni  =7.2308e+003 


return; 


%THIS  PROGRAM  CALCULATES  THE  OBJECTIVE  FUNCTION 


%  FOR  THE  MULTIDIMENSIONAL  FCM. 

fimction  J_in  *  j_mdiin(X,  V,  U»  m) 

(colE^rowE]  =  sizeOO; 
k-lJOwE; 

%for  the  1  class: 

VlasMatiix  =  V(:,l)*ones(l  jowE); 

tempi  *  ( X(:Jc)  -  VlasMatiix  y  •  ( X(:3c)  •  VlasMatrix ); 

templl-(  (U(l.:).^)  *  (diag(templ)')  ); 

J_outl  =  suro(templ  1); 


%  to  avoid  time-crunching  for-loops 

%  trick:  matrix-operation  is  faster, the  souj^t  norm  is 
%  automatically  the  diagwial  of  tempi ; 


%  to  avoid  time-cninching  for-loops 
%  see  above 

J_out2  =  sum(temp22); 

J_m  =  J_outl  +  J_out2; 
return; 


%for  the  2nd  class: 

VlasMatiix  =  V(:,2)"ones(l,rowE); 

tempi  =  ( -  VlasMatrix )'  •  ( X(:3c)  -  V2asMatrix  ); 

temp22  =  (  (U(2.:).^)  *  (diag(temp2)’)  ); 


%  THIS  PROGRAM  CALCULATES  THE  MEMBERSHIP  VALUES  FOR 

%THE  MULTIDIMENSIONAL  FCM. 

fimction  U = memb_fct(X,  V,  m) 

[colE,rowEl  =  size(X); 

k=  1  jowE; 

%for  the  1th  class: 

VlasMatrix  »  V(:.l)*ones(l,rowE); 

%  to  avoid  time-crunching  for-loops 

tempi  =  ( X(:Jk)  -  VlasMatrix )'  •  ( XOJc)  -  VlasMatrix ); 

%  trick:  matrix-operation  is  faster;the  sought  norm  is 
%  automatically  tire  diagonal  of  tempi; 

U_num(l  Jc)  =  ( diag(templ)' )  .'^  ( -l/(m-l) ); 

%for  the  2nd  class: 

Y2asMatrix  =  V(:,2)*ones(l  ,rowE); 

%  to  avoid  time-crunching  for-loops 

tempi  =  ( X(:,k)  -  VlasMatrix  )’  *  (  X(: Jc)  -  VlasMatrix ); 

•/•  see  above 

U_num(l  Jc)  *  ( diag(temp2)' -l/Cm-l)  ); 


U(I,:)-U  num(ljc)  J  (  U^num(l Jc)  +  U_num(2Jc)  ); 
U(2.:)  =  U_num(2Jc)  J  (  U_num(ljc)  +  U_numak)  ); 

%  If  Acre  is  a  third  dass,  *  U_num(3Jc) ... " 

%  must  be  also  considered. 


return; 

%  FAST  MULTIDIMENSIONAL  EVALUATING  PROGRAM 
clear  best^UQc; 

% - ^without  plots 

bcst^Uik  »  fc_means(5, 0.0000005,  Xselect); 


figure(l);clgJiold  on; 

8S-1:100; 

ldot(ss,best_Uik(l,:);+')y?lot(ss,best_Uac(2,:);-b'); 

%plot(s5,best_lHki3,:V*b') 

pause; 
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wrong_dq)S  =  0; 
wrong_nons  »*  0; 
figiire(2),clg;hold  on; 
fors=l;100 

ifbest  Uik(2,s)>=.5 
plot(s.best  Uik(2^);*b'); 
ifs>50 

wrong_dcps=wTong_dcps+ 1 ; 
end 
else 

plot(s,best  Uik(2^);+'); 

ifs<=50 

wrong_nons==wrong_nons+l ; 
end 
end 
end 

wpercent  «=  wiong_dq)s/50*  100; 

%Q)rintfCwrong_dcps,  percent^ 

%[wrong_dq>s,  wpercent] 
npercent  *  wrong_nons/50*  1 00; 

%fyrintfOTong_nons,  i^jcrcenO 
%(wrong_nons,  jq)eTCcnt] 

nn=<100*npercent); 
ww=(l  00- wpercent); 

fyrintfCVnO'.fyrintfCRJGHT  DETECTIONS:'); 
fyrintfOn^;^)rintfOn');§)tm  D^clustO; 

[nn  ww], 


%  USER  INTERFACE 

%  Program  B1 .  This  program  creates  the  start  button. 

figure(l);clg; 
set(gcCcoloi',[l  0  1]) 

button!  “  uicontrol(gcfi... 

'style'.'push',... 

•position',[195  150  75  75),... 

'string'.'START,... 

'callback', 'bt_choic'); 


•/.USER  INTERFACE 

^Program  B2.  This  program  displays  choices  to  nin  the  various  programs. 


clf  reset 

set(gcCcoloi*,(0  0  Ip 

titleCONE-DIMENSIONAL  MULTI-DIMENSIONAL') 

axis  off 

fim2  =  uicontroKgcf^... 

'style'.text',... 

'positionM25  40  155  200]); 

tt2  “  uicontroKgcf;... 

'style’, ‘text’,... 

'stiing’.TEATURE  ELIMINATlOhr,... 

•position’,[2521515540P; 

firm4  *  uicontrol(^C... 

’style’, 'frame',... 

‘position',I25  270  155  70P; 


tt4  *  uicontroI(^fi... 

'style', texf,... 

’string','FUZZY  C  MEANS  WITH  EVALUATION’,... 
^tion’,t35288  12545P; 


buttons  *  uicontroICgcf;... 
'style’.’push',... 

•position'.[38  275  125  25),... 
'5timg','INmAL  TEST,... 
'callback', *m^_tsf); 

frm  »  uicontroI(gcC... 
'style’,'frame',... 
•position',[20540  95185)); 


tt*  uicontroICgcf;... 
'styie'.'text',,.. 


'stiing'.'POLYGRAPH  DATA',... 
•position’, [207  165  85  40]); 

buttonl3  =  uicontrol(gcf;... 
'stylc'.'push',... 

•position’,[210  75  80  25],... 
'string','DATA  3*,... 
'callback'.load  fb<3y, 

buttonl  4  *  mcontroK^cC... 
'styic',*pushV.. 

■position‘,[2l0  10580  25],... 
'string', TDATA  2*,... 

'callback', load  fbc2'); 

buttonl  5  =  uicontrol(gcC... 
'style'.'push',.,. 

•position',(210  135  80  25],... 
'string', 'DATA  1',... 
'caHback'.’load  ftxl'); 


buttonl  6 = uicontrol(gcC... 
'8t>de','push',... 

^tion',[210  45  80  25],... 

'string'/CLEAR',... 

'caDback'.'clear'); 

buttonl  7  =  uicontroKgcC... 
'style'/push',... 

'posiliori,[45  200  125  25],... 
'string*, 'BOTH  >60%',... 
'callback'.'megaj'); 

buttonl  8  *  uicontroK^cC  ... 
’st>ie’,'push',... 

'p«itiori,[45  150  125  25],... 
’string’,’>80%  AND  >50%’,... 
'callback’,'mega_ii'); 

buttonlP  =  uicontroKgcC... 
'st)de','push‘,... 

■position’,[45  100  125  25],... 
’string','>50%  AND  >80%',... 
'callback’,'mega_iii'); 


button20  =  uicontrol(gcf;... 
'style', 'push',... 

•position’,[45  50  125  251,... 
'string'.’ONE  >98%',... 
’callback’,'niega_i\0, 

fim3  =  uicontrol(gcC... 
'style'.'frame',... 
*positiori,[320  40  165  185]); 


tt3  =  uicontroKgcf;... 

’style',‘texf,... 

'string'.'SEARCH  FOR  BECT  COMBINATION’.... 
■position',[350  150  120  65]); 

button21  =uicontrol(gct... 

'style', ‘push’,... 

•position', [318  230  192  25],... 

'string'.'FEATURE  COMBINATION*.... 
'callback'.'initf^; 

ftin5  *  uicontr(rf(gcf;... 

’style’.'&amc',... 

•position',[318  260  140  85D; 


tt5  *  uicontroICgcC-.. 

'style', Icxf,... 

'string'.'FUZZY  C  MEANS  WITHOUT  EVALUATION’,... 
t)osition’,[332  275  115  65D; 

button4  «  uicontrolii^C  • 

’st>1e’,'push’,... 

■position',[325265  12525],... 

'string'.’ALGORITHM’.... 

'callback’,*fc_ineans'); 

button22  =  uicontrol(gcC... 

'style', 'push',... 

•positiori,[337  125  100  25],... 

'string', 'GENETIC',... 

'callback','genetic4); 

button23  *  uicontrol(gcf;... 

'style', •push',... 

•po«tion’,[337  95  100  25],... 


'stjing'.TlANDOM',. . . 

'callba^, ‘random’); 

button24  =  uicontro!(gcf;... 

'styie’.^push’,... 

'positiaiU3376514525],... 

'string’,TSEUDO-EXHAUSnVE',... 

’callbaci^,'feature4'); 


%  THIS  PROGRAM  COMPARES  RESULTS  BY  DIFFERENT  SET-UPS 
•/.  OF  THE  W.  AN  EXAMPLE: 

w_conip"^cros(l  ,669); 
n_COTq)=zcros(l  ,669); 

indoHl  3  5  15  17  19  22  293031  33363738  3940  50); 
selindex=l:17; 

w  compCnuIex)  *  se!w_j)crccn^sc!indcx)  -  wjpcrccnt^mdcx); 
nrcompCmdex)  =  seln j>ercent(sdindex)  -  n jxsrcentCmdcx); 

RindexF{70  141  155  177  197  200  202  21 1  214  216  235  449  450  453  458  462  600); 
selin(tejp=  18:34; 

w_coii^>(Rindex)  -  sclwjpercent(selindex)  •  w_pcrcent(Rindex); 
n_conip(Rmdex)  =  sclnj?crcent(sdmdex)  -  n jperccnKRindex); 


•/•for  linewis; 

newindices={4  12  18  52  68  82  176  395  451  459  460 ). 
w_co*np(newindices)  =  w_j)erccnt(newindices); 
n_conjp(newindices)  *  n_perccnt(newindices); 

m=ll  3  4  5  12  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68  70  82  141  155  ... 
176  177  197  200  202  21 1  214  216  235  395  449  450  451  453  458  459  460  462  600); 


[m;in2w_percent;m2n_perccnr,w;w_compCm)’,n_compCm))' 


•/•  ANOTHER  EXAMPLE: 

w_co!np=2eros(l,669); 
n_comp=2eros(l  ,669); 

indeHJ  3  4  5  12  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68 ... 
70  82  141  155  176  177  197  200  211  214  216  235  395  449  450  451); 
selindot=l:38; 

w_cotnp0ndex)  =  selw_perccnt(selmdex)  •  wjpcicen^mdex); 
n_coTnp0ndex)  seln jpercent(selindcx)  -  n_pcrcentCmdex); 

RindexH453  458  459  460  462  600); 
selmdejc=40:45; 

w_comp{Rindex)  »=  sdw_perccnt(sdmdex)  -  w_percent(Rmdex); 
n_conq)(Rindex)  =  seln_pcrcent(sclmdex)  -  n_p«cent(Rindex); 


%for  1  newy; 
ncwindices={452]; 

w_conip(ncwiindices)  =  w_percent(ncwindiccs); 
n_com^ncwindices)  =  n_percent(ncwindices); 

in={13  4  512  15  17  18  19  22  29  30  31  33  36  37  38  39  40  50  52  68... 

70  82  141  155  176  177  197  200211  214  216  235  395  449  450  451  452  ... 
453  458  459  460  462  600); 

[in;m2w_j)erccnt-,m2n_percent;w;w_comp0n)  ;n_comp0n))' 


•/•  THIS  PROGRAM  SELECT  AND  EVALUATE  FEATURE  GROUPS 
%  ACCORDING  TO  THE  THRESHOLD. 

dima»on=669; 


H); 

forg*l:dimension 

%  . . ATTENTION:  Change  parameters  for  nF=3... 

if(  (n_pcrcent(g)<*M0)  &  (w_pcrcent(^»40)  ) 

H+1; 

ggO)^; 

m2wrong_dcps(I>^vrong_dcps(g); 

in2w_pcrcent(l)==wj>crccnt(g); 


in2wjok0)=100-m2w_percent(l); 


in2wTong_nons(I)=wrong_nons(g); 

ra2n_pcrcent(l)=nj)ercent(g); 

in2n_ok(I)=l  00-m2n jpercentO); 

m2z(!>=2(g); 

(n_percen^g)<=25)  1  (w_pCTcent(g)<=25)  ) 

dse 

w(D-0; 

Old 

end 

end 

1 

§>rintfCm2ft_#,  in2wrong_dcps,  m2w_ok,  in2wrcmg_nons,  m2n  ok,  m2itcrations,  bests’); 
h=li; 

in2wrong_dcps(h) 

m2w_ok(h) 

m2wrong_nons^) 

in2n_ok(h) 

in2z(h) 

w(h)]’ 


•/•  THIS  PROGRAM  REPRESENTS  ONE  THE  RANDOM  SEARCH 
%  FOR  4-TUPLE  FEATURE  COMBINATIONS. 

indH); 

for  1=1:10000 

aaa  =  round(10*rand(l,4));  4*ft~A-size  ofno“14 

%  ifaaa(I)>=7,aaa(l)=aaa(l>5;end; 

%  if8aa(2)>=7,aaa(2)=aaa(2)-5;end; 

%  if  aaa(3)>=7,  aaa(3>=eaa(3)-5;end; 

ifaaa(l>==0,  aaa(I)=l  l;end; 
if  aaa(2)=0,  aaa(2>=12;end; 
ifaaaC3)=0, 8aa(3)=13;end; 
if  aaa(4>=0,  aaa(4)=14;end; 

wMe  (  (aaa(l)=aaa(2))  |  (aaa(l>=aaa(3))  |  (aaa(2)=^aaa(3)) ... 

I  (aaa(2)=aaa(4))  (  (aaa(l)=aaa(4)) ... 

I  (saa(3)=aaa(4))  ) 

aaa  =  roimd(10*rand(l,4)); 

%  ifaaa(I)>=7,aaa(l)=4aa(I)-5;end; 

%  ifaaa(2)>=7,aaa(2>=aaa(2>5;end; 

%  if  aaa(3)>=7,  aaa(3>=aaa(3>5;end; 

if  aaa(l>=0,  aaa(I)=l  1  ;end; 
if  aaa(2)=0,  aaa(2>=12;end; 
if  aaa(3>=0.  aaa(3)=13;end; 
if  aaa(4)==0,  aaa(4)=  1 4;end, 

end, 

i. 

indi, 

gaa, 

dear  Xselect; 

Xsclect=Xsel(aaa,:); 

initlast. 

S - ATTENTION:  LIMITATIONS - •  %if  * - 

%if(  ((nn>=80)  A  (ww>=80))  |  ((nn>=84) |(ww>=84))  ) 

if  (((nn>=81)&<ww>=81))  ]  ((nn>=86)&(ww>=79)))  % - &A.  4*ft  x3m5m2 

y.if  ( (im>=70)  &  (ww>=80)  )  % - 4*fi  xlm5 

mdf=mdi+I; 

tl_coinbin0ndi)  =  eaa(I); 
t2_combinCmdi)  =  aaa(2); 
tS^combmOndi)  =  aaa(3); 

•4_conibin0ndO  *  aaa(4);S - 4*ft 

n_combres0nd0  =  nn; 
w_combres(jndi)  *  ww, 

§^tfC»»»»»»>»»»»»»»»»»»^; 

size(al_coinbin) 

fyrintf|>»»»»»»»»»»»»»»»»»*); 

end 

end 

>=l3ndi; 

(tl_combin(j) 

»2_combin()) 


•3_combm(j) 

a4_combm(j) 

n_combres(j) 

w^combresO’)]' 


%  This  program  exhaustively  tests  all  possible  combinations  of 
%  size  eight  in  x3  from  the  number  of  features.  It  then  records  the  ones 
%  that  meet  the  if-then  criteria  below. 

%  cleaiCiniO  for  normal  initialization. 

loadx3 

featureHSI  H 1  450  452  197  459  30  ] 
n^ength(features) 

for 

XscKCl :  1 00)=x3(features(0,1 : 100); 
end 

if  existCiniO^l 

%  program  continuatioTL  No  need  to  initialize  otiier  variables. 
il“init(l); 
i2*init(2); 
i3=init(3); 
i4=init(4); 
i5=mit(^; 
i6=init(Q; 
i7=mit(‘^; 
i8=iiut(8); 
else 

%  initialize  all  variables. 

mdi=0; 

record^!]; 

il=l; 

i2*2; 

i3=3; 

i4=4; 

i5=5; 

!6=<S; 

i7*7; 

i8=^; 

end 


while  i!<=n-7 
wdiile  i2<=n-d 
while  i3<=n-5 
while  i4<=n-4 
while  i5<=7i-3 
while  i6<»n-2 
white  i7<=n-l 
«dule  i8<=n 

aaa==til  i2  i3  i4  i5  i6  i7 iSJ 
inch 

clear  Xselect; 
Xselect=Xsel(aaa, :); 

initiast; 


- ATTENTION:  LIMITATIONS - "  %if"* 

ave  =  (nn+wwy2; 


if(  ((nn>-=81>^ww>-81))  |  (ave>-83)  ) 
indHndi+1; 

record={record  ;  features(aaa)  nn  ww]; 
§mntf(’»>»»»»»»»»»»»»»»»»'); 
(features(aaa)  nn  ww] 
end 

i8=i8+l; 

end  %  end  is  loop 

i7=i7+l; 

iS«i7+l; 

end  %  end  i7  loop 

i6=i6+l; 

i7-i6+l; 

i8=i7+U 

end  %  end  i6  loop 

i5»>i5+U 

i6=i5^-l; 

i7^6fl; 

»=i7+U 

end  %  end  i5  loop 

i4Hi4+l; 

i5=ri4+l; 

i6=i5+I; 

17-46+1; 


i8==i7+l; 

end  %  end  i4  loop 

i3*i3+l; 

i4=i3+l; 

i5=i4+l; 

i6=i5+I; 

i7=i6+l; 

i8=i7+l; 

end  %  end  i3  loop 

i2=i2+l; 

i3=i2+l; 

i4=i3+I; 

i5=i4+l; 

i6=i5+l; 

i7*i6+l; 

i8=i7+l; 

end  %  end  i2  loop 

iim+l; 

i2=il+l; 

i3=i2+l; 

i4*i>+l; 

i5=i4+l; 

i6=i5+l; 

i7=i6+l; 

I8N7+1; 

end  %  end  il  loop 

record 


%  Genetic  algorithni  in  search  of  the  optimal  n*tuple 
%  from  a  gene  pool  of  features. 

%  This  version  records  the  actual  feature  numbers  in  the 
%  matrix  'record',  not  the  index!? 

%  x3.  Set  m  in  initfast. 

%  set  init=l  for  automatic  initialization 


coniment='x3,  m“5, 15-tupIe.' 

n=15; 

loadx3 

clear  Xselect; 

features=(9  1 1  30  50  39  81  235  358  359  363  449  197  29  450  453  457  458  478  ... 

Ill  452  482  361  15 3637 32  8 67 79 460] 

•/•features=I4  5  8  9  12  18  19  22  29  30  33  36  39  40  50  56  62  76  7981  ... 

%  1 1 1  1 14  163  197  235  358  359  361  363  403  449  450  452  453  456  457 ... 

•/.  458  477  478  482  534  625  ] 

feature_num=Iength(features) 

for  f=I;feature_num 

XselCCl :  1 00>=x3(features(0,l :  1 00); 

end 

clear  x3; 

clear  average_fitness; 
ifinit=“l 

%  initialize  population  size,  crossover  rate,  mutation  rate,  etc. 

population_size=200; 

mutation_rate=0.001 ; 

crossover_rate=0.7; 

record=zeros(20,n+3); 

indi=0; 

%  initialize  population 
landCuniform^; 

population=fix((featurc_num  -  .0000001)  .*  iand(population_size,n))  +  1; 
end 

%  start  evolution 

for  generation=I  rlOOOOO 

generation 

%  test  the  population  for  fitness 
for  f  *  1  :population_size 
Xselect  “  XseKpopulation(C:)v);*/« 

initfast;  %  test  each  individual 

fitness(f)  “  abs((nrt+wwy2  •  20);  subtract  20  to  exaggerate  the 

%  diflGcrencc  in  fitness  ratio 

%if(  ((nn>=70)&(ww>“70))  |  (filness>=56)  |  ((nn<*20)&(ww<=20)) ) 
if(  (fitness(0>=<5)  |  {(nn<'20)&(ww<-20))  ) 
indHndi+l 

recordCmdi,:)  -  [features(population(C:))  generation  nn  ww];% 
[features(population(t:))  generation  nn  ww] 
end 
end 


%  display  average  fitness  in  percentage 
•verage_fitness(^cration)=^ean(fitness)  +  20 


%  REPRODUCTION!! 

%  reorder  the  6tness  values  for  easier  computation 
fit_ineasure(  I  )>=fitiiess(  1 ) ; 
for  f=2TX)pulation_sizc 
fit_ineasure(f^=fit_ineasure(f- 1  )+fitness(f); 
end 


fOT  ^l:population_size 

%  randomly  pick  orw  individual  to  copy  into  the  new  population 
%  individuals  widi  higher  fitness  values  are  more  likely  to  survive 
tcmp=fit_ineasure(population_size)  .*  rand; 
inde3c*find(abs(fit_measure-temp)  *■*=  inin(abs(fit_measure-temp))); 
if  t^  <=  fit_mtasureCmdex(l)) 

new j>opulation(C:)=I>opulation^dex(l ),:);% 
else 


new_population(C:)=populationOndex(l )+ 1 


end 


end 

populationFnew_popuIation; 


%  CROSSOVER!! 

M; 

while  f<=popvdation_si2e 

if  rand  <=  crossover^ratc 
mate  =  f; 
crossover  =  0; 

while  (f  <  population_sizc)  &  (crossovei=0) 

M+1; 

if  rand  <=  crossover_iate 
•/•  actual  crossover 
crossover"  1; 

tcinp"fix((n  •  l.(X)001)  .*  rand)  +  2;*/# 
gene_temp=population(mate,temp:n),% 
population(mate,tempii>=population(Ctempjt);% 
population(ttempm)=gene_temp; 
end 


end 

f=f+l; 


%  MUTATION!! 

*/•  Note:  Modified  Aug.  19  due  to  a  bug 

num_mutation=T)opulation_si2e  *  mutation_iate  •  n  .*  (randn  +  1); 
for  fi=l  jium^mutation 

populatioT>(fix((popuIation_size-0.00000 1 ).*rand+ 1  ),fix((n-0.0000 1 )*rand+l)). .  . 

*  fix((feature_num  -  0.000001)  *  rand  +  1); 
end 

%  save  record  in  case  of  crashing 

save  crashrec  comment  record  average_fitness 

%  go  to  next  generation 
end 

%  display  record  of  good  individuals  in  history 

comment 

record 

%  (sort(record(l  iidi,  1  m)*)’  record(l  iidi^  1 3i+3)] 


%  SELECTION  AND  INITIALIZATION  OF  THE  DATA  CENTERS 
%  FOR  THE  LMS  FILTER. 

%  •initiain_sess"  »  Polygraph  sessions  which  are  used  for 
%  INITialization  of  the  "data_centcrs"  and  TRAINing. 

^Thc^initrain’sessiortstresetinawaythatthe  1st  part 
%  (before  the  •border")  represents  the  non-deeptive  and  the 
%  2nd  part  (after  tire  bwder)  the  deceptive  scssiwis. 


dear, 

%* 


%• 


%• 


To  be  set  for  each  polydat  J  (ftx3,  ftx2,  ftxl):  •••••••••••••••••• 

whidifeatures_3  =  [1:30]; 
iiondses»cms_3  =  [H -50]; 

%I1  68912  1618  2124  27  28  32  35  44  48]; 
depsessions  3*151:90}; 

%[51  53  58  59  63  67  72  75  82  85  88  89  93  95  lOOj; 

» 

wfaichfeatures_2  *  []; 
iKmdsessions_2  *  (J; 
dqKcssions_2  *  Hi 


whichfeatures_l  *  (J; 
nondsessions_l  *  [); 
dcpsessioirs_l  *  []; 


•/••  ATTENTION;  The  DIMENSION  of  each  ’’whichfeatures_...“  is  to  be  equal! 
•/c*  (or  aero) 

. . . * . ****** . 


if  Iengll^whichfeatures_3)  length(whichfeatures_2)  | ... 

Ici^th(whichfeatures_2)  -=  Iength(v^chfeatures_l) , 

%int^!!! !!!!!!!!!!!!!! 

ftnintfCCheck  "whichfeatuies"!  They  are  different  big!\n'); 

§)rintf^e  dimensions  are  as  foIIowing:\n'); 

IJptintf^*); 

^sdntfC  1st  2nd  3rd\n'); 

disp(  pength(whichfeatures_l),  length(^chfeatures_2), ... 
lengtfi(whichfeatures_3)l ) 

jljiintfCYOU  DO  NOT  NEED  TO  CHANGE  THE  EMPTY  ONESINnO; 
^tfriF  THATS  THE  CASE;  PRESS  ANY  KEY  TO  CONTINUE-Nn*); 
^tfC!l!l!!!!!M!!!!!!!!!!!!!!!!!l!!!!!!!n!!l!!!!!!\n’); 

pause; 


end; 

border = kngth(nondsessions_3)  +  length(nondsessions_2) ... 
+  lo^A(TX3ndsessions__l); 


WA%polydat_3: 

ifsizfi(nondsessions_3,I)  0, 

hMd  c:\useis\rarmn\fcnfi\muhidim\fbc3; 

diro  =  Iength(whichfeatures_3); 
f*l:dim; 

Nteinp_3(C:)  *  x3(whichfeatures_3(0,  nond$essions_3); 
Dtanp_3(f;:)  *  x3(whichfeatures_3(0,  dcpsessions_3); 

ckarx3; 

end; 

%%%polydat_2 

if  sizie(iKiodscsak)ns_2, 1)  0, 

load  c:\iisers\ramin\fcni\inultidini\fb<2; 

dim  *  lcngth(whichfeatures_2); 
f=I:dim; 

Ntciiip_2(i;:)  =  x2(whichfeatures_2(0,  nondsessions_2); 
Dtemp_2(f;;)  =  x2(whichfeatures_2(f),  dcpsessions_2); 

dearx2; 

end; 

%%%I>oIydat_l 

if  si2e<n(Hidsessions_l,l)  0, 

load  c:\uscrs\rarnin\fcm\rnultidim\ftxl; 

dim  *  lcngth(whichfeatures_!); 

^l:dim; 

Ntemp  1  (C:)  =  xl  (whichfeaturcs_l(0,  nondsessions_l); 
Dtemp jcC:)  *  xl(whichfeatures_l®,  dcpsessions_l); 

ckarxl; 

end; 

mitrain^sess  •  pNltemp_3‘;  Ntemp_2';  Ntemp_r; ... 

Dtemp_3';  Dtcrop_2';  Dtcmp^l'J; 

howmany  »  siaeCinrtrain_sess,l); 


xned](initrain_$ess); 


%  TWO  FEATTJRES  AT  A  TIME  -  PLOT  EXAMPLE: 
%plolCinitrain^sess(1 :40.1),inilrain_scss(l  :40,4),'.y) 
Hholdon 

•/^)lol0nitrain_sess(41 :80,lXinitrain^sess(41 :80,4);  jO 


%  SELECTION  AND  INITIALIZATION  FOR  LMS  FILTER. 

%  31k  "initram"  data  represents  Polygraph  sessions  which  are  used  for 
%  INTTialization  and  TRAINing  of  the  "data^centers"  and  irqjut  data. 


%  The  "initrain"  data  arc  set  in  a  way  that  the  1st  part  •  before  the 
%  "(TC_)border"  -  represents  the  non-deqitivc  and  die  second  part 
%  -  after  the  "(JCJborder"  •  the  deceptive  sessions. 

%The  prefix  "nond"  represents  the  non_decptive,  and  "dep*  die  deceptive 
%  elements. 

clear, 

. . 


TO  BE  SET  FOR  EACH  polydatj  (ftx3,  ftx2,  ftxl): 

%*  First  for  the  data_centers: 

nondsessions_3  ■*  [1:20]; 

•/•[I  6  8  9  12  16  18  21  24  2728  32  3544  48]; 


depsessions  3 -[51:70]; 

"  %I51  53  58  59  63  67  72  75  82  85  88  89  93  95  100); 

» 


%• 

» 

nondscssions_2  *  []; 

dcpscssions_2  *•  [J; 

ft 

nondsessions^l  ■=  []; 

dcpsessions^l  =  []; 

ft 

ft 

%*  Now  for  the  mput  data  for  which  the  filter  is  to  be  * 

•/••  to  (C)lassify: 

%• 

TC_nondsessions_3  =  [  1 :30]; 

TC_dcpsessions_3  =  [5 1  ;80];  * 

TC_nondsessions_2  *=  []; 

TC  dcpsessions_2  =  []; 

TC_nondsessions_l  =  []; 

TC_dcpsessions_^l  *  []; 

%•  • 

%•  And  finally  for  the  selected  features: 

%•  • 

whichfeatures_3  =  [1:30]; 
whichfeatures_2  =  [J; 
whichfeatuies^l  =  []; 

%• 

•/#•  ATTENTION:  The  DIMENSION  of  each  *'whichfeatures_..."  is  to  be  equal!  • 

%•  (or  zero) 

. . . . 

if  length(w1iichfeatures_3)  ~=  length(whichfeatures_2)  | ... 
length(whichfeatures_2)  ~=  length(whichfeatures_l) , 

^nintfT! !!!!!!!!!!!!!! 

fyrintfTCheck  "whichfeatures"!  They  are  dififerent  big!\n'); 
fyrintf^e  dimensions  are  as  following:\n'); 
fjmntf^'); 

^rintlC  1st  2nd  3rd\n’); 

disp(  (length(whichfeatures_I),  length(whichfeature$_2), ... 

length(wduchieatures_3)] ) 

fyrintf^*); 

fyrintfCYOU  DO  NOT  NEED  TO  CHANGE  THE  EMPTY  ONES!\n'); 
fyiintfCIF  THArS  THE  CASE:  PRESS  ANY  KEY  TO  CONTINUE. Nn); 
fyrintfC!!  I!!!!!!?!!!!!!!!!!  !!!!•!!  !l!!l!!!l!!!!!!!!!!!\n'); 
pause; 


end; 

border  =  length(nondscssions_3)  +  lengd»(nondsessions__2) ... 

+  length(nondsessions_l); 

TC_border  *  lcngthCrC_nondsessions_3)  +  lengthCTC^nondsessions J) ... 
+  lengt^C_ivondsession$_l); 


•/••/*%  poIydaO: 

dim  « length(^chfcatiires__3); 
ifdim-— 0, 


load  c:\iiscrs\iarnin\fcm\multidim\ftx3; 
f*l:dim; 

if  length(TC_nondsessions_3)  ~=  0, 

TC_Ntemp3(C0  *  x3(whichfeatures_X0.  TC_nondsessions_3); 
end; 

if  length(TC_dcpsessions_3)  0, 

TC_Dtemp^3(C:)  =  x3(whichfeatures^3(0,  TC_dcpscssions_3); 
end; 


if  length(nondsessions_3)  0, 

Ntenip_3(C'.)  *  x3(whichfeatures_X0,  ncmdsessions_3); 
end; 

if  length(dcpsessions_3)  0, 

Dtemp_XiiO  *  x3(wduchfeatures_3(0,  dq)sessions_3); 
end; 

clear  x3; 

end; 


%%%  polydat_2 

dim  *  fengtii(wduchfeatures_2); 
ifdim-— 0, 


load  c:\users\ramin\fcm\raultidim\ftx2; 

^l:dim; 

if  length(TC_nondsessions_2)  0, 

TC_Ntemp_2(C:)  “  x2(%duchfeatures_2(0,  TC_nondsessions_2); 
end; 

if  lcngth(TC_dq>sessions_2)  0, 

TC_Dtemp_2(C:)  *  x2(whichfeatures_2(0,  TC_dq>sessions_2); 
end; 

if  lcngth(nondsessions_2)  ~=  0, 

Ntemp_2(C:)  “  x2(whichfeatures_2(0,  nondsessions_2); 
end; 

if  length(dcpsessions_2)  -=  0, 

Dtemp_2(C:)  •=  x2(whichfeatures_2(0,  dcpscssions_2); 
end; 

dear  x2; 

end; 

%%%  poIydat_! 

dim  =  length(wduchfeatiires_l); 
if  dim  -=0, 


load  c:\users\ramin\fcm\multidim\ftxl; 
f=l:dim; 

if  lengthCrC_nondsessions_l)  -=  0, 

TC_Ntemp_l(C:)  =  xl(whichfeatures_l(f),  TC_nondsessions_l); 
end; 

if  length(TC_dcpsessions_l)  0, 

TC_Dtemp_i(C:)  »  xl(wduchfeatures_l(f),  TC_dcpsessions_l); 
end; 

if  length(nondsessions_l)  0, 

Ntemp_l(C:)  =  xl(whichfeatures_l(0,  nondsessions^l); 
end; 

if  length(dcpsessions_l)  0, 

Dtemp_l(C’.)  *  xl(whichfeaturcs_l(f),  dq»cssions_l); 
end; 

dearxt; 

end; 

TC  initrain  « [TC_Ntemp_3';  TC_Ntemp_2';  TC_Ntcmp_r; ... 

~TC_Dtemp_3';  TC_Dtomp_2';  TC_Dtemp_rj; 

ccnt_initram*=  [Ntemp_3';  Ntemp_2’;  Ntcmp_r;  ... 

Dtemp_3';  Dtemp_2';  Dtemp_r]; 


%  LMS  FUZZY  ADAPTIVE  FILTER. 

fimctkm  [ncw_theta,  new_^data_ccnteTs,  new^sigma,  ou^_label]  - ... 
»<^pt2zy(theta,  data_cento,  sigma,  in|>ut_vcct,  desire,  step) 
HfyrintfCsize(11ieta):*);si2e(theta), 

%^?rintfCsize(sigma):^;si2e(sigma), 


%  Get  the  dimensions  of  matrices  and  verify  their  consistency, 
[label^no,  ft_noJ «  si2e(data_centers); 

if (nabe!_no,  ft_no]  size(sigma))  1^1,  ft_no]  -=«izeCiivut_vect))  }... 
dlabel^no,  1  ]  size(theta)) 


end; 


enoiCmatrix  dimensions  are  wrong!  *) 


%  Evaluate  Gaussian  membershipfimctions: 


distances  ^  (ones(labeI_no,l)  •  input_vect)  *  data_ccnters; 
%fyrintfCsize(<listances):');si^distances), 

%  To  creat  compatible  dimensions:  Fill  input_vect  down  into  an 
Vo  (label_no  x  ft_no)  matrix,  so  that  it  is  the  same  input  for  all 
%  Oabel_no)  rules,  and  then  subtract  data_ccnter$  from  h. 

a  -  cxp(  -0.5  •  $um(  ( (distances  J  sigma)  .^2  )'  y  ); 

%  Widiout  "sum":  a  *  Uik  i.e.  membership  values 
Vo  ctc.etc... (conventional  way) 

%+++ 

%fyrintfCsize(a):');*ize(a), 

Vo  Centroida!  defuzzification; 
b  =  sum(a);%§>rintfCsize(b);'),size(b), 
ou^_labcl  *  sum(thcta  .*  a)  /  b; 

54+++ 


Vo  Adaption: 

tempi  =  step  .*  (desire  •  output_labeI)  .*  a  Tb; 
new_theta  =  tiieta  +  temp  1 ; 

tcmp2  =  ( (tempi  •  (frteta  -  output^label) )  *  ones(!,  ft_no))  .•  ... 

distances  J  (sigma  .'^2), 
new_data_centers  =  data_ccnters  +  temp2; 

new^sigma  *  sigma  +  temp2  ,*  distances  J  sigma; 

Vo^ 


return; 


•/•  LMS  RLTER INTIALIZATION  (TRAINING  AND  TESTING) 
•/.FIRST  VERSION 

•/.  clear  eveiylhing! 
clear, 

%loading ...: 

load  c:\users\rainin\fcm\multidim\ftx3; 
whichjeatures  =1:1 00,54 - to  change!  I ! 


Vo  die  data  from  the  'person*  w1\o  is  to  be  tested: 
pcrson  =  2; 

tes4)erson  =  x3(wduch_features, person)'; 


polysession(I,:)  =  x3(which_features,l)’,*/4nondecp 
•/.•/.•/.ix3(81,l),  x3(l  11,1),  x3(235,l),  x3(450,l),  x3(452,I)]; 

polyscssion(l  6,:)  =  x3(wiiich_features,  100)’;%decp 
•/.•/.•/.[x3(81,100),  x3(l  1 1,100),  x3(235,100),  x3(450,100),... 
•/.•/4*/<»x3(452, 1 00)J;  54  polygraph  data  for  two  sessions, 

54  i.e  one  truthful  &  one  deepetive 

poly5ession(2,;)  =  x3(which_features,48)';54nondecp 
polysessionO.:)  “  x3(wduch_features,5)';54nondecp 
polyscssion(4,:)  *  x3(which_features,8)';54nondecp 
polysession(5,:)  *=  x3(which_features,9)';54nondecp 
polysession(6,:)  *  x3(wduch_features,12)’;54nondecp 
polyscssion(7,:)  *  x3(which_features,lQ';54nondecp 
polysessionjs,:) = x3(which_fi»tures.  18y;*/4nondecp 
polysession(9,:)  =  x3(which_fcatures,2iy;*/4nondecp 
po!ysession(10,:)  «  x3(wduch_featurcs,24)';54nond«p 
p<^ysession(l  1,:)  *  x3(which_fcatures,27)';54nondecp 
polyscssion(l  2,:)  »  x3(which_features,28)';54nondecp 
polysession(l  3,;)  “  xXwhich_£eatures,32)';54nondecp 
polysession(14,:)  *  x3(»1uch_features,35)';54nondecp 
po!ysession(15,:)  *  x3(wduch_features,44)';54nondecp 

polyscssion(l  7.;)  »  x3(which_features,95)';5'.decp 
polysessionOS,:)  *=  x3(which_features,93)’;54decp 
polysession(l  9,:)  =  x3(^ch_features,89)';54decp 
po!ysession(20,:)  =  xXwhich_featurcs,88)';54decp 
polyscssion(2 1 ,:)  •  x3(which_features,85)';54decp 
pol>«ssion(22,:)  =  x3(which_features,82)';54decp 
polyscssion(23,:) = x3(which_features,7^;*/.decp 
poly5Cssion(24,:)  *  x3(^iduch_features,72)';54decp 
pol>^cssion(25,:)  »  x3(5sduch_features,67l';54decp 
polyscssion(26,:)  =  x3(which_features,63)';54decp 
polyscssion(27,:)  *  x3(^ch_features,59)';*/4decp 
pdysession(28,:)  *  x3(which_features,58)';*/odecp 


pol>«ession(29,:)  =  x3(which_features,53)’;%decp 
polysessjcm(30,:) = x3(wiuch_features,5l)';%dcq) 
piowmany,  dim]  -  size(polysession),%  "howmany"  mxist  be  even! 
half**  howmany/2; 

dear  x3;  %save  memory  &  clear 
%+++ 

HmitxaHation  &  clear 

$tep*  0.005; 
output -zeros(  1,2) 


oulputjncan  **  n.  2r 

input_mean  **  polyscssion; 
inpat_width  =  1  *  ones<howmany,  dim); 


%  Testing(see  100  for  des) 


(dummy,  dummy,  dummy,  output]  = ... 

•dapt2zy(output_mean,  iTq)Ut_mean,  ij^ut_width,  testperson,... 

100,  step); 

%  Test  how  good  the  ouq>ut  is  at 
•/•  the  beginning. 


end, 

output 

pause; 

figure(l);clg 

ploW^ou^ut,'.*); 

•/•irfot(ou^ut_mean,’.b'); 

hoVdon; 

•/«inesh0nput_width); 


•/•  SEE  ABOVE  -  SECOND  VERSION. 

%Usei  interlace  to  improve! 

SmmALIZATION: 

%  It . .  «  M 

step  *  0.5;  */•  Learning  factor 

%  The  jmcEx  *TC"  represents  the  input  data  for  which  the  filter 
is  to  be  COrained  to  (C)lassify: 

TC^howmany  =  sizc(TC_initrain,  1); 

(howmany,  dim]  =  si2e(cent_initrain);  54  representing  data_centers 
clear  output; 

output  *  ze!Os([TC_howmany.  1  ]); 

%  "+!•  represents  the  nondeceptive  and  "♦!"  the  deceptive  data: 
init_theta_non  *=  +1  *  ones(border,  1); 
init_theta_dcp  =  -1  *  (mes((howmany-border),  1); 

output^mean  *=  [iiut_dieta_non;  init_theta_dcp];  %  —  data_centers 

ir^nit_mean  =  cent^xnitrain; 
input_width  «  100  *  ones<howmany,  dim); 

. . . 


%  Before  any  training ... 

H  Test  bow  good  die  output  is  at  die  b^inning: 

for  fc=l  :TC_howmany 

ifk<«TC_border 

des=»+-l; 

dse 

des*=-l; 

end 

(dummy,  dummy,  dummy,  outputOc)]  ■=  ... 
adapt2^output_mean,  ii^ut__mean, ... 
input^width,  TC_initrainC^:),... 
des,  step); 

end, 

clear  dummy  , 
output. 


figure(l);clg 

plo^ou^C+Oi 


%plot(output_mean,’*’b'), 

hold  on; 

pause; 

%mesh(input_width); 

%  Starting  training;  DO  A  BETTER  USERINTERFACE! 

for  1*1:30 

forj*l:5 

for  fc=l  :TC_howmany 

ifk<*TCJ)order 

des=+'l; 

else 

des=-l; 

end 

[output_inean,  mput_mean,  input_width,  output^k)]  ... 
adaptzzy<output_mean,  input_mean,  input_width, ... 
TC_initrain(k,:),  des,  step); 

end, 

end, 

output, 

6gure(l); 

plot(outpuV.');  Voaxisdl  100-0.22.1]); 

%plot(output_niean,'*b'); 

%mcshCmput_width); 

%pause; 


end; 


% . .  SAVING  THE  FILTER  CHARACTERISTICS: 

Q)rintfr!!n!!!!!!!!Mn}n!!!!!!!!!!!n!l!!!!n!!nnn!!!!?!!\n'); 

fyrintfClF  YOU  WANT  TO  SAVE  THE  CHARACRERISTICS  OF  THIS  FILTER,\n'); 
fyrintfTPLEASE  TYPE  ANY  NUMBER(#0  FROM  l-99!\n'); 
fyrintfTTHIS  FILTER  WILL  BE  THEN  SAVED  AS  'filter#-  INn*); 

clear  numb; 

numb  =  input(The  filter  numbeit^#)  is;*); 

%  By  default:  numb=(],  i.c.  nothing  wffl  be  saved. 

if  numb 

numb  =  int2str(numb); 
com  ={'save 'filter',  numb, ... 

'  whichfcatures_3', ... 

'  whichfeatures_2', ... 

*  whichfeatures^r, ... 

'  output_inean', '  output_mcan', ... 

'  input_inean’, '  input^width’J; 

evaI(com); 

end; 


%  CREATING  THE  ELLIPTICAL  CLUSTERS  FOR  THE  VISUAL 
%  INSPECTIONS  -  AND  ALSO  FOR  STTINGTHE  RULES. 

function  [x,y]=ellipse(xcenter,yccnter,x^’idth,ywidth) 

angle=(0;0.02*pi:2-pii; 

x=xwidth  .•  cos(an^e)  +  xcenter, 

y®ywidth  .*  sin(angle)  +  ycenter, 

plot(x,y;-') 


y.  TEMPORARY  LMS  SETTING  -  TEST 

fimction  ou^ut_labeWuzztemp0nput_vect) 
theta=(  1  1  -1  -iT; 

data  centeis^-l  -0.5 ;0.0.25 ;0 0 ;  1  03); 
sigrnH  0  5  0.8 ;  0.5  0.25 ;  0. 1  0.2  ;  0.6  0.5 ); 


%  Get  the  dimensions  of  matrices  and  verify  their  consistency: 
[labcl_no,  ft  no]  *  size(data_centers); 

ifCPabcI  no7ft  no]  —  $ize(sigma))  1  (Jl.ft^no]— sizeCmput^vcct))  |... 
(Pabdlno,  if—  size(theta)) 


OTorCmatrix  dimensions  are  wrong! ) 

end; 

%+++ 


54  Evaluate  Gaussian  membersh4>functiOTs: 

distances  *  (ones(label_no,l)  *  input^vect)  •  data_ccnter5; 

a  -  cxp(  -0.5  .•  sum(  ( (distances  7  s^a)  .''2  )'  )'  ); 


%  Ccntroidal  deflusification: 
b  »  sumCa); 

output_label  -  suni(theta  .*  a)  /  b; 
output_label  =  output_lflbeI  2; 

return; 


%LMS  FILTER  TESTING. 

%  Experimenting  with  the  use  of  adaptive  fuzzy  logic 
%  in  polygraph  classification. 

init*rirput(Do  you  want  to  initialize  aB  parameters?  '.’s'); 
ifinit=*-y 

%  Initialize  the  parameteis  for  fiizzy  LMS  algorithm. 

•/•  Output  of  I  means  nondeceptive 
%  Output  of-1  means  deceptive 
%  length(output_mean)  *=  #  of  rules 
fyrintf('initia]Lang\n'); 
outpul_mean={  1  1  -1  -If; 

%  ii^ut^mearH  centers  of  first  nile ;  centers  of  second  rule ;  etc.  ], 
input  mean=[  -1  -0.5 ;  0  -0.1 ;  0  0 ;  1  0.3 ); 

•/•  in^t_width-(  widths  of  first  mle  ;  widths  of  second  nile  ;  etc.  ]; 
input^widtlH  0.5  1 .3 ;  0.5  0.25 ;  0.1 0.2 ;  0.6  0.5  ]; 


fcatures=(45 1  452);  %  Select  die  features 

5tep=0.005;  */•  Select  learning  rate 


%  Select  training  data 

ndcp_3=l:15; 

dcp_3='51:65; 

ndcp_2=ll; 

ndcp_Hl; 

dcp_l=[); 


%  Nondeceptive  sessions  in  x3  for  training 
%  Deceptive  sessions  in  x3  for  training 


%  Note  that  nondeceptive  data  in  xl ,  x2,  and  x3 
%  are  the  same,  so  ndcp_2  and  ndcp_l  are  really 
%  redundant. 


load  x3; 
load  x2; 
loadxl; 

Ntrain={xl(features.ndcp_l)  x2(features.ndcp_2)  x3(features.ndcp_3)] ; 
Dtram=txl(features,dcp_l)  x2(features,dep_2)  x3(features,dcp^3))’; 


%  Select  testing  data 

ndcp  3»{);  •/•Nondeceptive  sessions  in  x3  for  testing 

dcp_3=66:100; 

ndcp_2=[]; 

dcp_2-[5l:l00]; 

ndcp_l=16:50;  ^  , 

dcp_H51:100];  %  Note  that  nondeceptive  data  m  xl,  x2,  and  x3 

•/•  are  the  same,  so  ndcp_2  and  ndcp_I  are  really 
%  redundant. 


Ntest={xKfeaturesjidcp_l)  x2(features,ndcp_2)  x3(features,ndcp_3)r; 
Dtcst=xi(features,dcp_l)'; 

Dtest2=x2(features,dcp_2X; 

Dtest3=x3(features,dcp_3)’; 

clear  xl; 

clear  x2; 

clear  x3; 

clear  record; 

epoclvO; 

end 


%  Test  fuzzy  system  before  any  training 
%  Test  training  data  first 
clear  Nouiput. 
clear  Doutput; 

[Ntr,diimmy)=^ize(Ntrain);  %  Ntr  -  total  #  of  nondeceptive  sessions 
(Dtr!dummy)=size(Dtram);  *4  Dtr  -  total  #  of  deceptive  sessions 

enorCNumbcr  of  nondeceptive  and  deceptive  training  data  mismatch'); 


end 

fori=l:Ntr  .  ^ 

(dummy, durTimy.durnmy.Noutput®}=adaptzzy(output_meaiynput_incan,.. 

input  width.Ntrain(i,:),l.step); 

Idurnmy,durnmy,(hmimy.Doutput(0}==adaptZ2y(output_mearwut_mean^ 

input_widtiiJDtrainCi.:X^  1  ,step)i 


end 

iecord(epoch+l.l  :2)H0engtii(find(Noutput>0)yNtr)  Ocngth(find(Dou^ut<0)yDtr) ); 
squared_erroi<epoch+ 1,1 :2Hntean((l-Noutput),^2)  mean((Doutput+ 1).''2)1; 
fyrintfi^’percent  correct  nondeceptive  and  deceptive  detections  for  training  dataiNn"); 
disp(record(epoch+ 1,1 :2)) 


%  Now  test  testing  data 
clear  Noutput, 
clear  Doutput; 

[Ntc,dummy)*sizc(Ntest);  %  Ntc  -  total  #  of  nondeceptive  sessions 


fori=l:Nte 


[dummy, duiraTiy.duimTjy,Noutput(OHdaptzzy(outputjneaivnput_niean,.^^ 

input_width,NtestCi,:),l 
end 

[Dte.dummyHizeCDtesO;  */•  Dtc  =  total  #  of  deceptive  sessions  m  xl 
fori=l;Dte 

[dummy , dummy, dummy , Dou^ut0)l=ad^tzzy(output_mcan4i^ut_mean,. . . 

innut  widtiLDtestrL:).-l,step); 


squared  cmor(epoch+l,3:4Hmean((l.Noutput).^2)mean((Doutput+l).^2)l; 
record(<i^h+l,3:4HOength(find(Noutput>0)yNte)Oength(find(I>>utput<0))^)]^ 

[Dte,dummy]=^ze(Dtest2);  %  Dte  « total  #  of  deceptive  sessions  in  x2 

clear  Doutput; 

fori»l:Dte 

[duinmy,duinmy,dummyJ)output<5)h«daptz2y(output_meaivnput_mcan,... 

input_wid^Dtest2Ci,0,- 1  »step); 


squared_ciroi<epoch+1.5:6Hinean((I-Noutput).^2)mcan((Dou4)ut+l).''2)l; 

recordCepoclH- 1 ,5:6H(length(find(Noutput>0)yNte)  aength(find(Doutput<0)yDte)l; 
[Dte,dummyHi2ei)tesl3);  S  Dte  -  total  #  of  deceptive  sessions  in  x3 
clear  Doutput; 
fori=l:Dte 

[dummy,dummy,dummyJ>>utput(i)l=ad^tzzy(output_mean,mput^mean,... 

input_width,Dtesl3(i,t),- 1  ^tep); 


8quared_error(epoch+  l,7:8)=Imean((I  .Noutput).^2)  mean((Doutput+ 1),''2)]; 
iecord(cpoch+ 1 ,7:8)=[(len^(find(Noutput>0)yNte)  (length(find(Doutput<0)yDte)}; 


^tfl:irainingpclpapc3:\n’); 

disp(record(epoch+ 1 ,:)) 


•/•  Start  training  and  testing 
fyrintfCresults  after  trainingNn') 
while  epoch<100000 
epoch=epoch+ 1 
clear  Noutput; 
clear  Doutput; 

Vo  Training 
fori=l:Ntr 

[output_ineajynput_mean,input_width,Noutput(i)^.. . 

ffHapt77y<output_meanjnput  meanjnput  width,  . 
KtrainCi,:),l;5tep); 

(output_mcan4nput_mean,input_width,Doutput0]=. . . 

adaptzzy(output_mean,input_roeaiijli^ut_width,.  .. 

Dtrain(i,:),-l,step); 

end 

%  end  one  epoch 
%  Test  training  data 
for  i=l:Ntr 

[dummy  ,dummy,dummy,NoutputCi)J=- 

adaptzzy(output_mean4nput_mean4nput_width,. . . 

Ntrain(i,;),l,step); 

[dummy,durTuny, dummy  ,DoutputO)}=... 

d^ptzzy(ou^ut_meanJlnput_inean4Tq)ut_width,. . . 

DtrainCi,:i’l^tep); 

end 

%  Record  results  of  training  data  at  the  end  of  an  epoch 

squared_CTTor(epoch+l.l:2Hn'ean((l*Nou^'i*)/'2)mean((Doutput+l).''2)]; 

iecord(epoch+l.l:2>=[Oength(6nd(NouQ)ut>0)yNtr)  (length(find(r)output<0)yDtr)  J; 


%  Now  test  testing  data 
clear  Nouqjut; 
clear  Doutput; 

[Nte,dummy}=”size(Ntcst); 

fori=l:Nte 

[dummy,dummy , dummy , Nou^ut©Hidaptzzy(ou^ut_mean,ii^ut_mean,. . . 

nq)ut_width,NtcstCi,:),Mtep); 


end 

[Dte,dummy)=si2e(Dtest); 

fOTF=l:Dte 

[dmtuny,dununy.duinmy,DoutputCO]=adaptzzy(output_mean,nM?ut_mean,... 

input^widtfiJ^testOi,:),- 1  ,stcp); 


•quaied  erroi<epoch+l,3:4Hmean<(l.Noutput).''2)  mean((Dou^ut+l).''2)]; 
record(epoch+U:4HO«ngfti(find(Nou^ut>0))/Nte)  Oength(find(Doutput<0)yDte)  ]; 

[Dtc,dummyHizc(Dtest2);  %  Dte  -  total  #  of  deceptive  sessions  m  x2 

clear  Doutput; 

fori®l;Dte 

[dummy,duinmy,duinmyJ>ou4>ut(i))=Bdaptzzy(output_mean,mput_mcan,... 

xnpul_widfti,Dtest2Cu:),*Utep); 


squared  enor<cpoch+l,5:6Hmean((l.Noutput).''2)  mean((Doutput+l).''2)l; 

record(epoch+l,5:6HOength(find(Noutput>0)yNte)0«3igth(find(Doutput<0)yDte)l; 

[Dte,dummyHi2e<Dtest3);  %  Dte  -  total  #  of  deceptive  sessions  in  x3 

clear  Doutput; 

forf=l:Dtc 

[duiTuny,duinmy,durTmiy4>>utput0)Hdaptzzy(ou^tiLin<»iwut_mean,... 

input_wid^Dtest30,0,’  1  iStep); 


squared  crroi<epoch+l,7:8Hmean((l'Noutput).''2)  mean((Doutput+l)'^2)l; 
xecord(^h+ 1 ,7;8H(l«ngth(find(Noutput>0)yNtc)  Gength(6nd(Doutput<0)yDte)); 

fprint^^ainingpc  1 
(iisp(record(epoch+ 1 ,:)) 

end  %  Go  to  next  epoch 


%  Experimenting  with  die  use  of  adaptive  fuzzy  logic 
%  in  polygraph  classification. 

fortiiat=l;l 

%  Initialize  die  parameters  for  fuzzy  LMS  algorithm. 

%  Ouqmt  of  1  means  nondeceptive 
%  Ou^nit  of-1  means  deceptive 
%  lcr^(output_mean)  =  #  of  rules 
§mnriCinitialmng\n'); 
outpiJt_mean={  1  1  -1  •1}'; 

%  input_mean=(  centers  of  first  rule ;  centers  of  second  rule ;  etc. ); 
input_iDean=[  -1  -0.5 ;  0  -0.25 ;  0  0 ;  1  0.3  ]; 

%  in^_width=I  widths  of  first  rule ;  widths  of  second  rule ;  etc.  ]; 
inpot_width=(  0.5  0,8 ;  0.5  0.25 ;  0.1  0.2 ;  0.6  0.5  ]; 

featuies={ 45 1  452] ;  %  Select  the  features 

stcp=0.005;  %  Select  learning  rate 

traineis=l  0,  %  Select  #  of  training  samples  fiom  each  category 

%  Select  training  data 
temp_n=Tandperm(50); 
temp  d=50+randpeTm(50); 

ndcp  3=[1:5  7;I0  12  13  15  16  18:20  22  23  25  26  28  29  31  32  34  35  37  38  40  41  43  44  46:49]; 
dcp_i={51  54  57  60  64  67  70  73  76  79  82  85];%  Deceptive  sessions  in  x3  for  training 
ndcp  2=1]; 

dcpj={51  53  56  59  62  65  68  71  74  78  81  84]; 
ndcp  1=11; 

dcp  1=151  54  57  59  62  65  68  71  74  77  80  83]; 

%  Note  diat  nondeceptive  data  in  x  1 ,  x2,  and  x3 
%  are  the  same,  so  ndcp_2  and  ndcp_l  are  really 
%  redundant. 

loadx3; 

loadx2; 

loadxl; 

Ntrain=lxKfeatuTesjidcp_l)  x2(featuresjidcp_2)  x3(featuresjidcp_3)]'; 
Dtram=lxl(features,dcp_T)  x2(fcatures,dcp_2)  x3(features,dcp_3)y; 

%  Sdect  testing  data 

ndcp  3=16  1 1  14  1721  24  27  30 33 36 39 42 45 50]; 

dcp_3=152  53  55  56  58  59  61:63  65  66  68  69  71  72  74  75  77  78  80  81  83  84  86:100]; 
ndcp  2=1]; 

dcp_H52  54  55  57  58  60  61  63  64  66  67  69  70  72  73  75:77  79  80  82  83  85:100]; 
ndcp  1={]; 

dcpj=152  53  55  56  58  60  61  63  64  66  6769  70  72  73  75  76  78  79  81  82  84:100]; 

%  Note  that  nondeceptive  data  in  xl,  x2,  and  x3 
%  are  the  same,  so  ndcp_2  and  ndcp^l  are  really 
%  redundant. 

NtcsHxl(features,ndcp_l)  x2(fcatures,ndcp_2)  x3(features,ndcp_3)]’, 

DtesHxl(features,dcp_r)  x2(features,dcp_2)  x3(features,dcp_3)]*; 

clear  xl; 

clear  x2; 

clear  x3; 

clear  record; 

clear  temp_n; 

clear  temped; 

epoch=0; 


%  Test  fuzzy  system  before  any  training 
%  Test  training  data  first 
clear  Noutput; 
clear  Dou^ut, 

lNti,dummyHize(Ntram);  %  Ntr  » total  #  of  nondeceptive  sessions 
lDti;dmnmy)==«ize(Dtrain);  %  Dtr  -  total  #  of  deceptive  sessions 

errorCNuraber  of  nondeceptive  and  deceptive  training  data  mismatch'); 
end 

focM;Ntr 

ldummy,dummy,dummy,Noutput(i)]*8daptzzy(output_mean4rput_mean,.. 

iTpul_wid^NtrainCi,0»  ^ 

lduinmy,dummy,dinnmy,Doutput(i)^daptzzy(output_mean,irput_mean,.. 

in|nit_wid^DtrainO,:),- 1  ^cp); 


end 

%%  ^nintfCResults  of  training  data  before  trainingNn'); 

•/•%  Noutput 
%%Doutput 

%  Record  results  v , 

recofd(epoch+l,l:2HO«”8^fi^^®^^*P^)^*^)  Oengtf»(find(Doutput<0)yDtr)  ]; 
§>rintfi:percent  correct  nondeceptive  and  deceptive  detections  for  training  data:\n'); 


disp(rccord(epoch+ 1,1:2)) 


%  Now  test  testmg  data 
clear  Nou^t, 
clear  Doulput; 

tNte,dunimyHi2e(Ntest);  %  Nte  » total  M  of  nondccqjtivc  sessions 
foTFlJ'fte 

[dummy.dummy , dummy , Noutput(»)]=«daptzzy(output_niean4nput_mean,. . . 

ii^ut_width,Ntc^:),  1  ,step); 
end 

[Dte,dummy}=size(Dtest);  %  Dte  -  total  #  of  deceptive  sessions 
forHl:Dtc 

[dummy, dummy, dummy , Doutput(i)Hd^lzzy(o«tpuLniean,input_mean,. . . 

input_width,DtestC»,d*  1  .step); 
end 

if(Nte  — 0)A(Dtc  — 0) 

y*%  Qjrint^esults  of  testing  data  before  trainm^'); 

%%Nou!put 
%%  Doutput 
H  Record  results 

recoid(epoch+  U:4H(Ien«ttt(6nd(Nou^ut>0)yNte)  acngth(find(Doutput<0))^) ); 
fyrintfl^pocent  correct  nondeceptive  and  deceptive  detections  for  testing  data:\n'); 
isp(record(epoch+ 1 3:4)) 
end 


%  Start  training  and  testing 
§mntfl>esults  after  trainingNnO 
while  cpoch<50 
epoch»^epoch+l 
clear  NouQ>ut; 
clear  Doutput; 

%Traming 

fori=l:Ntr 

[output_mean4nput_mean4i^t_width,Noutput0)l=... 

adaptzzy(output_iivean4nput_meanJ[nput_width,. . . 
Nttain(i,:),l,step); 

[output_mcan4nput_roean,input_widthJ>>uq)ut®}=... 

aiiptzzy(ou^ut_mearunput_mean,input__width, . . . 
Dtrain(i,:),-l,slep); 
end 

%  end  one  epoch 
%  Test  training  data 
for  i=l:Ntr 

(dummy,dummy,duirutiy3IoutputCi)]==. . . 

adaptzzy(output_incan,input_mean4nput_width,. .. 
NtiainC»,:),l,step); 

[dummy  ,dummy,dummy, Doutpuh5)l=.” 

tdaptz7y(oulput_meajunput__meanjnput_width,. . . 
DtramCi,:),-l,slep); 
end 

%%  §)Tintfl;'results  of  training  dataVn*) 

%%  Noutput 
%%  Doutput 

%  Record  results  of  training  data  at  the  end  of  an  epoch 

record(epoch+ 1 ,1 :2)=((length(find(Noutput>0))/Nlr)  Oength(find(Dou^ut<0)>T)tr)  J; 
fyrintfCpercent  correct  nondeceptive  and  deceptive  detections  for  training  data:\n') 
disp(reoord(epoch+l,l  ;2)) 

if(Nte-^0)&(Dte"^0) 

%  Now  test  testing  data 
clear  Noutput; 
clear  Dou^ut; 
forp=l:Nte 

[duinmy,duinmy,durnmy3Joutput(i)}=adaptzzy(ou^_mean4nput_meaa ... 

ir^)ut_wid^Nte^:),  1 ,8tep); 
end 

fbrM:D(e 

[dummy,dummy,dummy J)ou^t0))=^daptay(output_mcan4nput_roean,. .. 

input_width,Dte^:),- 1  ;5tep); 
end 

%%  fyrintfl^esults  of  testirtg  data\n') 

%%  Noutput 
%%  Doutput 

iecord(epoch+13:4H(tength(find(Nou^ut>0)yNte)  (kn^find(DoutpuKO)yDte)  ]; 
fyrintfi^’percent  correct  nondeceptive  and  deceptive  detections  for  testing  data:\n') 
disp(recoid(cpoch+ 1 3:4)) 
end 

end  %  Go  to  next  epoch 

maximum(trial)=max(record(;3>+tecord(:,4)); 

tempKfind((iecord(:3)^record(:,4)>==*maximum(triaI)y  00000]; 

maxin»(tii^  1 :5>=temp(  1 :5); 

maxima(trial,l 

inaximum/2 

end  %  Go  to  next  trial 

inaximum*^aximum/2 


EPILOGUE  -  Motivation,  challenges  and  risks 


I  was  easily  fascinated  by  the  idea  of  a  lie-detector  at  the  very  first  moment  I  heard  about  it.  I 
thought,  ’we  are  not  supposed  to  lie  anyway  and  a  lie-detector  can  help  us  find  and  prevent  a 
major  part  of  the  crimes  committed  in  our  society.  I  became  even  more  motivated  to  do  this 
research  by  an  innovative  way  of pattern  recognition,  namely  the  fuzzy  approach. 

But  very  soon,  1  also  began  to  realize  its  danger  -  while  juggling  with  numerical  data  and  being 
far  from  the  reality  of  testing  actual  human  beings  and  judging  them  by  an  algorithm. 

An  example:  Too  'good'  detection  rates! 

In  my  project,  I  obtained  in  certain  cases  up  to  97%  correct  detection  rate.  That  is,  indeed,  an 
impressive  number.  However,  the  emphasis  lies  on  "certain  cases" -not  only  in  this  thesis. 

A  non-technically  oriented  user  of  such  a  product  is  tempted  to  put  too  much  trust  into  these 
kinds  of  high  rates.  Even  if  we  have  a  stable  lie-detector  with  99%(!)  correct  detection,  this  still 
means  that  one  out  of  100  persons  will  be  judged  incorrectly. 

In  our  daily  life,  we  do  not  have  the  natural  skill  to  "see"  who  is  deceptive,  but  some  biological 
and  psychological  features  that  enable  us  to  estimate  whether  and  to  what  degree  someone  is 
lying.  This  is  exactly  what  I  have  exploited  in  this  project.  In  fact,  even  the  fuzzy  approach  is 
similar  to  the  human  way  of  categorizing  someone's  deceptiveness  in  soft  terms  like  "She  lies 
seldom"  or  "He  is  often  deceptive",  instead  of  hard  labeling  like  "She  is  truthful"  or  "He  is 
deceptive". 

After  all,  I  am  convinced  that  no  lie-detector  -  even  if  it  could  work  easily  with  different 
polygraph  formats,  and  is  perfect  in  technical  terms  -  can  ever  be  constructed  with  such  a  high 
detection  rate^^  that  one  could  judge  a  person  without  any  witnesses  or  other  additional 
inquiries.  We  may  only  use  a  lie-detector  as  a  helpful  "objective"  tool,  but  never  as  an  ultimate 
decision  maker. 

My  initial  goal  was  to  be  aware  of  this  responsibilty  and  not  to  lose  the  global  perspective  while 
dealing  with  technical  details.  I  hope  I  have  accomplished  this. 

I  also  hope  for  an  environment  where  we  do  not  judge  people  who  hurt  us,  but  do  forgive  them. 
In  that  case,  we  ourselves  are  forgiven  too,  since  all  of  us  deserve  to  be  judged,  don't  we! 


Ramin  Djamschidi 
San  Jose,  September  1994. 


^^See  e.g.  chapter  43.  for  "Outlier  effect"  and  "Performance  limitations”. 
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Appendix  E:  Errors  in  the  “Relevant  Only”  Data 


Fuzzy  Pattern  Recognition  of  PolyGraph 


E 


12/19/95 


NON-DECEPTIVE  DATA 


_ KEY _ 

•standard;  CODE.Oll,  012, 013, 021  022,  023, 031, 032, 033 
••index;  enor  message  m  MATLAB  reads, 

>^process 

“Index  exceeds  matrix  dimensions. 

»ErTor  in=>c;\users\uUca\non\extractf.m 

on  line  48==>  start  =  beginCi)  +  30  .•times(first_channeU); 

»Error  tn=>c;\users\ulka\non\process.m 
on  line  6=>feature  =  extractffz,  feattire_Iist);" 

^readJ;  CODE.Olc,  .02c,  .03c,  .023,  .033,  .01 1.  .021,  .031,  .013 
confusing  as  to  how  to  READ3  these  files 

•••N/A;  discs  were  unable  to  be  processed 

'^^extra:  CODE.041,  .042,  .043  processed  as  t4 _ 


NEWS.XLS 


NON-DECEPTIVE  DATA 

j 

! 

I  I 

ERS 

SUB# 

CODE 

#  OF  FILES 

EXTRA  FILES 

ERRORS 

1 

1 

2 

$$EACOWO 

Standard* 

none 

none  • 

2 

1 

4 

$$EAD5LX 

standard 

none 

none 

3 

1 

6 

$$EANWKF 

13 

0.005 

none 

4 

1 

8 

$$EAOZD6 

standard 

none 

none 

5 

1 

9 

$$EAQWB9 

standard 

none 

none 

6 

1 

11 

$$EARKZ6 

standard 

none 

none 

7 

1 

12 

$$EARJS0 

standard 

none 

none 

8 

1 

13 

$$EA%KR9 

standard 

none 

index**  13 

9 

1 

15 

$$EA%H#L 

standard 

none 

none 

10 

1 

18 

$$EB21YL 

standard 

none 

none 

11 

1 

22 

$$EC4QN3 

standard 

none 

none 

12 

1 

26 

$$EC7N7X 

standard 

none 

none 

13 

1 

33 

SSECLMTU 

standard 

none 

none 

14 

1 

34 

$$ECMA%C 

standard 

none 

none 

15 

1 

35 

$$ECM7GX 

standard 

none 

none 

16 

1 

36 

$$ECMWB3 

standard 

none 

none 

17 

1 

40 

$$EC#G20 

standard 

none 

none 

18 

1 

43 

$$EC$OOF 

standard 

none 

none 

19 

1 

44 

$$ED805U 

standard 

none 

none 

20 

1 

45 

$$ED8LUI 

standard 

none 

none 

21 

1 

46 

$$ED9439 

9 

read3'' 

N/A*** 

22 

1 

47 

$$ED9TCX 

standard 

none 

none 

23 

1 

50 

$$EDBQR2 

standard 

none 

none 

24 

1 

53 

$$EDCZYZ 

12 

extra'^'^ 

none 

25 

1 

59 

$$EDPY4# 

standard 

none 

none 

26 

1 

60 

$$EDQCY9 

standard 

none 

none 

27 

1 

61 

$$EDQ28X 

standard 

none 

none 

28 

1 

62 

$$EDQOCF 

standard 

none 

index  11 

29 

1 

65 

SSEDRKGO 

standard 

none 

none 

30 

1 

66 

$$EDRMU# 

standard 

none 

none 

31 

2 

11a 

$$FZIMEU 

13 

.005,  extra 

index  11a 

2 

11b 

SSFZISQ# 

standard 

none 

none 

32 

2 

12 

$$FZIT4L 

standard 

none 

none 

33 

2 

14 

$$FZJ52# 

standard 

none 

index  11 

34 

2 

30 

$$FZZN1Y 

10 

0.005 

index  13 

35 

2 

32 

S$FZ#D6J 

10 

0.005 

none 

36 

2 

33 

$$FZ#0HX 

13 

.005,  extra 

div  by  zero  13 

37 

2 

35 

$$FZS3A& 

standard 

none 

none 

38 

2 

36 

S$F#8CY9 

11 

.005,.STR 

none 

39 

2 

38 

$$F#9FJL 

10 

0.005 

index  12, 13 

40 

2 

41 

$$F#B6SC 

standard 

none 

none 

41 

2 

42 

$$F#B6C# 

standard 

none 

none 

42 

2 

45 

$$F#NMDX 

standard 

none 

index  11 

43 

2 

47 

$$F#NHQT 

standard 

none 

none 

44 

2 

48 

$$F#&7GC 

standard 

none 

index  13 

45 

2 

51 

$$F#QJTF 

standard 

none 

none 

46 

2 

52 

$$F#S0KR 

standard 

none 

none 

Page  1 


NEWS.XLS 


ERS 

SUB# 

CODE 

#  OF  FILES 

EXTRA  FILES 

ERRORS 

47 

2 

53 

$$F#RRD5 

Standard 

none 

none 

48 

2 

54 

$$F#RYFR 

12 

extra 

index  t3 

49 

2 

55 

$$F#SALQ 

10 

0.005 

index  t3 

50 

2 

56 

$$F$C#2# 

standard 

none 

none 

51 

3 

2 

$$F$D%YR 

standard 

none 

none 

52 

3 

12 

$$F$I41X 

11 

.005, .STR 

none 

53 

3 

25a 

$$F$IUY0 

10 

0.005 

none 

3 

25b 

$$F$UI3X 

11 

.005,  .STR 

none 

54 

3 

31 

$$F$WNSF 

standard 

none 

none 

55 

3 

43 

$$F%51&G 

10 

.STR 

index  t1 

56 

3 

46 

$$F%5$UF 

standard 

none 

none 

57 

3 

49 

$$F%7K#0 

standard 

none 

none 

58 

3 

59 

$$F%JAK6 

standard 

1  none 

none 

DECEPTIVE  DATA 


KEY 


^Standard;  CODE.Ol  1, 012. 013, 021  022, 023, 031. 032, 033 
**  Index:  error  message  in  MATLAB  reads. 

»process 

"Index  exceeds  matrix  dimensions. 

»Error  in~>c;\users\ulka\non\extraciLm 

on  line  48=>  start  *  begin(i)  +  30  .*times(  fir5t_channeU); 

»Error  in— >c;\users\ulka\non\process.m 
on  line  6=>feature  =  extractftz,  feature_list);" 

(gformat;  files  were  unable  to  be  read.  Error  message  m  DOS  reads: 
>format  not  linked 
>abnorTnal  program  termination 

''^extra;  CODE.041,  .042,  .043  processed  as  t4 

^readJ;  CODE.Ol c,  .02c,  .03c,  .04c 

contusing  as  to  how  to  READ3  these  files 


to 


DECEPTIVE  DATA 


SUB# 


la 


1b 


1c 


5 


CODE 


$$G3#SGD 


$$EACLB6 


1$$G3$6HN 


$$EAN#XO 


SSEAOQXV 


|$$EB0289 


|$$EA%%MX 


$$EB2WE$ 


|$$EC77GI 


$$EC760R 


|$$ECIX9#_ 


|$$ECIVB0 


$$ECJHKO 


ISSECJVSI 


#  OF  FILES 


standard* 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


11 


standard 


standard 


standard 


standard 


standard 


standard 


SSECLODC 


SSECXAPG 


l$$ECYCG0 


l$$EDDBUX 


itSliJSSSi 


SSEDQYMF 


$$EDR3Xl 


$$EDS3ZL 


$$FZ7GP# 


$$FZIMEU 


$$FZJ358 


$$FZL9ZR 


$$FZLBY& 


$$FZMWSH 


$$FZWQQC 


$$FZW5T# 


$$FZYCM& 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


12 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


standard 


17 


10 


10 


standard 


10 


10 


standard 


standard 


13 


EXTRA  FILES 


0.005 


0.005 


none 


0.005 


0.005 


none 


none 


extra,  .005 


ERRORS 


3a 


none 


index  t3 


format 


none 


none 


index  t1,t2 


index  t3 


ERS 

SUB# 

CODE 

#  OF  FILES 

EXTRA  FILES 

ERRORS 

2 

31 

$$FZ2R&C 

12 

extra 

index  t2 

2 

44 

$$F#NC4B 

standard 

none 

none 

2 

46 

$$F#NGH3 

10 

0.005 

none 

2 

49 

$$F#&KWF 

10 

0.005 

none 

2 

50 

$$F#PUDW 

standard 

none 

none 

3 

14 

$$F$IK&0 

standard 

none 

none 

3 

16 

$$r$RJK6 

standard 

none 

none 

3 

36 

$$F%3C19 

standard 

none 

none 

3 

40 

$$F%4&C9 

11 

none 

3 

41 

$$F%4V0U 

standard 

none 

none 

3 

54 

$$F%I45# 

11 

.005,  .STR 

index  t1 

3 

62 

$$F%L350 

standard 

none 

none 

3 

66 

S$F%LXJ& 

standard 

none 

none 

